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ABSTRACT 


This  thesis  presents 


no-backup  par 

ser . 

The  pa 

programs;  it 

modifies  a 

it  becomes  a 

vali  d 

one,  a 

the  program. 

It 

au  tom 

listings  and 

is 

mach in 

currently  in 

use 

for  a 

called  SP/k. 

the  design  of  an  efficient  top-down 
rser  does  syntax-repair  of  invalid 
ry  syntactically  invalid  program  so 
nd  displays  the  modified  version  of 
atically  produces  formatted  source 
e-independent.  This  parser  is 
practical  teaching  subset  of  PL/I, 


The  parser  is  controlled  by  rabies  which  describe  the 
syntax,  formatting,  and  error-repair  for  the  language.  With 
appropriate  tables,  the  parser  can  be  used  for  the  LL(1) 
programming  languages.  The  language  to  be  processed  is 
described  by  syntax  charts,  which  are  hand  translated  to  a 
linear  syntax  language  that  is  equivalent  to  the 
corresponding  charts.  The  syntax  language  is  processed  by  a 
table  generator  to  produce  tables  for  the  parser.  The  table 
generator  itself  uses  the  parser  and  generates  its  own 
parsing  tables. 
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CHAPTER  1 


INTRODUCTION 


This  thesis  presents  the  design  of  a  table-driven  parser 
that,  automatically  repairs  syntax  errors  and  formats 
listings.  A  table  generator  that  produces  tables  to  drive 
this  parser  for  different  languages  is  also  presented. 


The  Parser  uses  a  top-down  no-backup  parse  strategy.  It 
makes  decisions  based  on  the  next  input  symbol;  it  can 
handle  grammars  that  are  LL(1)  [  Aho  and  Ullman].  The  parser 
does  automatic  formatting  of  source  text,  indenting 
logically  subordinate  code,  and  putting  each  statement  on  a 
new  line.  This  process  is  called  Ear agr aching .  The  parser 
also  does  syntax  repair  of  invalid  programs  at  the  source 
level.  That  is,  it  modifies  any  syntactically  invalid 
program  to  be  a  valid  one,  and  then  displays  the  modified 
text  for  the  user.  The  error  recovery  never  forces  backup. 


The  parser  is  currently  in  use  in  a  production  compiler 
for  a  sequence  of  subsets  of  PL/I,  called  SP/k.  This 
compiler  is  being  used  for  introductory  programming  courses 
at  the  University  of  Toronto;  it  ran  57000  jobs  in  October 
1974  [ UTCC  ] .  The  parser  is  machine-independent  and  is 
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currently  running  on  both  the  IBM  System/360  and  the  DEC 
PDP/1 1  computers. 


The 

parser  is 

table-driven.  The 

tables  encode 

the 

syntax. 

formatting. 

and  error  repair 

alternatives . 

In 

addition. 

semantic 

action  indicators 

are  also  embedded 

in 

these  tables. 

There  is  a  table  generator  which  can  process 
specifications  for  languages  that  are  LL(1)  and  meet  certain 
restrictions,  to  produce  tables  for  the  parser.  A  bootstrap 
version  of  the  table  generator  was  built  by  hand,  and  it  now 
generates  its  own  parsing  tables*  A  syntax  language  (which 
is  analogous  to  BNF)  was  invented  for  writing  specifications 
of  programming  languages.  It  is  intended  that  languages  be 
designed  using  syntax  charts;  these  charts  must  then  be 
translated  by  hand  into  the  syntax  language  to  be  machine- 
readable. 

The  rest  of  this  chapter  will  give  the  background  of 
this  thesis.  The  parser  was  originally  developed  for  the 
SP/k  language;  we  then  realized  it  could  be  put  to  wider  use 
if  language  specifications  could  be  automatically  translated 
to  tables.  The  next  few  sections  discuss  SP/k  and  the  tools 
used  to  develop  a  table  generator. 
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1 . 1  The  SP/k  Subsets  of  PL/I 


The  initial  phase  of  this  work,  namely  building  the 
compiler,  received  its  impetus  from  two  coordinated  needs. 
The  first  of  these  grew  out  of  Project  ED.  This  project 
involved  an  investigation  of  current  computer  studies 
programs  in  secondary  schools  in  Ontario,  and 
recommendations  for  materials  to  support  such  programs  for 
the  future  [Holt  and  Phillips].  Two  of  the  recommendations 
are  of  particular  interest  here.  It  was  suggested  that 
secondary  schools  should  have  their  own  computing 
facilities.  The  most  attractive  hardware  component  was  a 
minicomputer  to  be  supplied  in  each  school.  The 
corresponding  software  that  was  suggested  was  a  compiler  to 
support  a  sequence  of  structured  subsets  of  PL/I,  called 
SP/k.  This  language  sequence  also  was  important  in  that  it 
satisfied  a  second  need,  namely  to  supply  a  suitable 
teaching  tool  for  introductory  computer  courses  at  the 
university. 

SP/k  consists  of  a  sequence  of  nested  subsets  of  PL/I 
[Holt  and  Wortman  1973  ],  [Holt  and  Wortman  1  974  ].  The 
subsets  are  called  SP/1,  SP/2 ,  and  so  on.  Each  subset 
introduces  some  new  programming  language  feature  or 
features,  while  retaining  all  the  features  of  previous 
subsets.  The  subsets  contain  only  those  features  of  PL/I 
which  were  believed  to  allow  and  encourage  structured 
programming.  The  intended  pedagogical  use  of  the  sequence 
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would  involve  teaching  a  set  of  concepts,  and  then 
introducing  a  language  (subset)  which  supported  precisely 
these  concepts;  then  moving  on  to  teach  additional  concepts, 
and  presenting  an  extended  language  (larger  subset) 
supporting  these  concepts  as  well,  and  so  on. 

fit  the  present  time,  the  subsets  SP/1  through  SP/8  have 
been  designed  [ED].  The  following  list  briefly  summarizes 
the  features  introduced  in  each  subset. 


subset 

features 

SP/1 

fixed,  float,  character  string  constants  and 
expressions;  simple  output;  mathematical 
builtin  functions 

S  P/2 

variables;  declarations;  assignments;  simple 
input 

S  P/3 

comparison;  logical  expressions;  selection; 
repetition;  paragraphing 

SP/4 

character  string  expressions,  variables,  and 
builtin  functions 

SP/5 

multi-dimensional  arrays 

S  P/6 

procedures 

SP/7 

formatted  input  and  output 

SP/8 

structures;  files 

The  features  listed  here  are  common  to  many  languages.  All 
of  the  ’’dangerous"  features  of  PL/I  have  been  excluded 
[ Holt  ]. 


10 


1.2  The  SP/k  Compiler 


Programs  that  are  written  in  the  SP/k  subsets  can  be 
handled  by  most  PI/I  compilers,  but  it  was  decided  to  build 
a  compiler  that  would  combine  the  following  features: 

-  it  would  provide  fast  batching  of  small  jobs 

-  it  would  run  on  a  minicomputer 

-  i+  would  enforce  the  SP/k  restrictions  to  PL/I  by  not 

accepting  programs  outside  the  subsets 

-  it  would  do  source  level  syntax  repair. 

Accordingly,  a  compiler/interpreter  system  for  SP/k  was 
designed  and  built.  The  following  diagram  illustrates  the 
structure  of  the  SP/k  system. 


The  SP/k  compiler  performs  the  following  three 
functions:  analysis  of  the  input  program,  generation  of 
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pseudo-code,  and  interpretation  of  pseudo-code.  Each  of 
these  is  handled  in  a  separate  pass.  In  many  compilers,  the 
analysis  is  split  into  two  parts,  namely,  the  Scanner  and 
the  Parser.  Our  system  has  these,  and  in  addition  a  third 
part  called  the  Lister,  which  does  all  printing  of  source 
programs,  and  is  one  of  the  significant  parts  of  this  work. 
The  first  pass  is  referred  to  collectively  as  the 
Scanner/Parser/Lister.  The  second  and  third  functions  are 
called  the  Code  Generator  and  Interpreter,  respectively,  and 
were  written  by  other  members  of  the  project.  The  compiler 
currently  implements  SP/1  through  SP/6. 

The  Parser  was  designed  to  repair  syntax  errors  in 
programs.  The  motivation  for  this,  and  the  technique,  will 
be  discussed  in  the  body  of  this  thesis.  The  Parser 
controls  the  first  pass  and  calls  the  Scanner  and  Lister. 
It  has  a  representation  of  the  syntax  and  paragraphing  rules 
of  SP/k  in  a  table.  The  Scanner  breaks  the  input  stream 
into  tokens  (the  basic  elements  of  the  language  such  as 
keywords,  constants,  operators)  and  the  Parser  matches  these 
against  its  table  in  a  top-down  parse.  The  Parser  sends 
three  classes  of  information  to  the  Lister:  formatting 
commands,  tokens  of  the  input  program,  and  tokens  generated 
during  error  repair.  The  Lister* s  handling  of  this 
information  is  described  in  Chapter  6.  The  Parser  passes 
constants,  identifiers,  and  semantic  action  numbers  to  the 
Code  Generator.  The  System/360  version  of  the 
Scanner/Parser/Lister  uses  30000  bytes  of  code  and  a  run 
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stack  of  14000  bytes.  This  stack  includes  1000  bytes  for 
the  syntax  tables. 


1.3  Generalizing  the  Parser 

As  the  SP/k  system  was  being  built,  it  was  realized  that 
the  Scanner/Parser/Lister  could  be  used  as  a  component  in 
compilers  for  other  languages,  by  providing  appropriate 
tables  describing  those  languages.  Accordingly,  following 
completion  of  the  SP/k  compiler,  work  was  begun  on 
automating  the  generation  of  these  tables. 

The  first  step  was  to  develop  a  notation  that  could  be 
used  to  describe  programming  languages.  Syntax  charts  had 
been  used  in  the  work  on  SP/k,  and  these  were  adopted  for 
describing  other  languages.  However,  this  notation  was  not 
directly  machine-readable  so  a  corresponding  equivalent 
linear  notation,  called  the  Syntax  Language,  was  developed. 

The  final  component  of  this  research  involved  building 

» 

software  to  read  descriptions  of  programming  languages,  and 
to  produce  tables  to  initialize  the  Scanner/Pa rser /Lister 
for  processing  of  that  language. 

1.4  Using  the  SUE  System  Language 

All  of  the  programs  described  in  this  thesis  are  written 
in  the  SUE  System  Language  [Clark  and  Ham].  To  implement 
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the  SP/k  compiler  we  needed  a  language  that  was  readable, 
had  powerful  structuring  mechanisms  for  action  and  data,  and 
had  a  compiler  generating  efficient  code.  We  felt  that  SUE 
met  all  of  these  requirements.  In  addition,  it  was 
supported  by  a  great  deal  of  expertise  at  the  University  of 
Toronto . 

One  particularly  important  feature  of  SUE  is  its 
machine-independence.  There  are  SUE  compilers  that  emit 
code  for  both  the  IBM  System/360  and  the  DEC  PDP/11 
[Kalmar].  This  was  important  in  the  development  of  the 
original  SP/k  compiler  as  one  of  the  objectives  for  that 
system  was  that  it  run  on  a  minicomputer.  (It  is,  in  fact, 
currently  running  on  the  PDP/11  as  well  as  the  System/360.) 
The  table  generator  can  also  run  on  either  of  these 
machines,  although  it  will  be  used  on  the  System/360. 
However,  it  is  noteworthy  that  the  generated  parsers 
themselves  can  be  run  on  at  least  two  machines,  and  in  fact, 
on  any  machine  for  which  a  SUE  compiler  exists. 

In  general  terms,  SUE  is  much  like  Pascal  [Wirth  1973], 
and  the  particular  details  of  the  language  will  not  be 
important  in  most  of  this  thesis. 

1.5  Organization  of  the  Thesis 

In  Chapter  2  we  present  a  version  of  syntax  charts;  we 
use  this  notation  for  describing  SP/k  and  other  languages  we 
wish  to  implement.  To  use  the  Table  Generator  we  need  a 
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machine-readable  equivalent  of  charts.  The  Syntax  Language 
invented  for  this  purpose  is  presented  in  Chapter  3. 


The  next  few  chapters  present  the  Scanner/Parser/Lister 
in  detail.  The  syntax  table  that  the  Parser  uses  is  a  coded 
copy  of  a  description  in  the  Syntax  Language;  Chapter  4 
describes  how  the  Parser  interprets  the  syntax  table. 
Chapter  5  discusses  the  error  repair  technique  of  the 
Parser.  Chapter  6  presents  the  Lister  and  its  method  of 
formatting  programs. 


Chapter  7  gives  the  details 
Generator  to  produce  parsers  for 
conclusions,  and  suggestions  for 
Chapter  8. 


of  how  to  use  the  Table 
other  languages.  Our 
further  work,  are  in 
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CHAPTER  2 


SYNTAX  CHARTS 


This  chapter  presents  a  version  of  syntax  charts,  which 
are  used  for  describing  programming  languages.  Syntax 
charts  give  an  analytical  description  of  how  the  Parser  will 
recognize  the  language.  Chapter  4  will  discuss  the  Parser's 
processing  of  the  charts. 

This  chapter  gives  some  reasons  for  using  charts,  a 
brief  discussion  of  related  work,  and  an  example  which  is 
used  to  demonstrate  the  rules  for  drawing  charts. 

2. 1  Why  Use  Syntax  Charts? 

We  need  a  description  technique  for  specifying  the 
grammar  of  a  programming  language.  This  technique  must  have 
at  least  the  following  attributes  of  a  good  "programming" 
language : 

(i)  unambiguous:  statements  in  the  grammar  must  have  a 
precise  meaning 
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(ii)  powerful:  it  should  allow  descriptions  of  widely 
varying  languages  to  be  written 

(iii)  concise:  descriptions  must  not  strain  the  human 
reader’s  limit  of  comprehension. 

With  these  criteria  satisfied,  we  must  then  be  able  to  write 
descriptions  that  are  complete,  in  the  sense  that  we  want  to 
answer  the  question  "Is  the  sentence  S  in  the  language 
described?"  for  any  sentence  S. 

Backus-Naur  Form  (BNF)  [Knuth  1964]  has  been  widely  used 
as  a  description  technique,  and  although  it  achieves 
objectives  (i)  and  (ii) ,  and  to  a  large  measure  (iii)  as 
well,  we  did  not  use  it.  We  felt  that  BNF  was  not  as 
readable  as  the  notation,  called  syntax  charts,  which  will 
be  presented  in  this  chapter.  People  can  be  trained  to  read 
BNF,  but  it  is  not  an  easy  notation  to  use,  or  to 
understand.  It  was  felt  that  humans  would  be  more 
comfortable  with  a  more  graphic  type  of  presentation,  as  in 
[Wirth  1973]  ,  and  judging  by  the  experience  of  the  members 
of  Project  ED,  this  admittedly  very  subjective  opinion  would 
seem  to  be  justified. 

Humans  read  grammars  to  understand  both  the  language 
described,  and  the  compilation  process.  Sometimes  grammars 
that  are  convenient  for  describing  languages  to  people  can 
not  be  directly  used  by  parser  generators.  An  example  of 
this  is  the  XPL  system  of  [McKeeman  et  al.  ]  which  sometimes 
requires  dummy  productions  to  allow  control  to  be  passed  to 
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the  semantic  routines  at  the  appropriate  places.  This 
particular  aspect  of  the  problem  is  inherent  in  any  system 
which  allows  semantic  action  to  be  taken  only  when  some 
reduction  is  made  in  the  parse  stack.  Since  BNF  does  not 
provide  any  explicit  semantic  information,  this  is  the  usual 
way  of  handling  semantics.  As  a  result  of  this  and  similar 
restrictions,  we  often  see  two  different  representations:  a 
"reference  grammar"  and  an  "implementation  grammar"  for  the 
same  language.  It  is  easier  to  maintain  consistency  between 
documentation  and  implementation  if  both  are  based  on  the 
same  language  description.  There  is  no  need  to  write  extra 
rules  to  introduce  semantics  into  the  grammar  described  in 
our  syntax  charts,  because  semantic  actions  are  allowed  at 
almost  any  point  in  the  description.  The  charts,  which  are 
used  as  a  basis  for  generating  a  parser,  can  be  presented  in 
a  form  which  is  convenient  for  human  readers. 

The  description  that  is  presented  by  charts  consists  of 
three  classes  of  information,  viz.,  syntax,  semantics,  and 
paragraphing.  Implicitly,  the  error-repair  is  controlled  by 
the  charts  as  well.  The  discussion  in  Chapter  5  will  make 
this  clear. 


So  the  charts  do  not  just  express  "syntax"  information. 
This  makes  the  term  syntax  charts  (and  later  the  term  syntax 
language)  slightly  misleading. 
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The  next  section  mentions  some  related  work  on  charts. 


Then  an  example  language  is  presented  and  used  to  illustrate 
the  different  aspects  of  charts. 


2 . 2  Related  Work 


Syntax  graphs  have  their  basis  in  work  done  by  [Conway]. 
His  paper  essentially  defines  a  set  of  automata  that  call 
each  other,  each  graph  representing  one  machine.  Each 
machine  recognizes  a  part  of  the  input  string.  [Tixier]  did 
a  more  formal  analysis,  proving  that  the  charts  described  a 
proper  subset  of  LR  (k)  languages,  which  he  called  RCF 
(regular  context  free).  [Lomet]  points  out  that  Conway's 
original  paper  allowed  multiple  return  points  from  one 
machine  to  another,  and  that  Tixier' s  analysis  had  not 
permitted  this.  lomet* s  paper  shows  that  with  multiple 
return  points,  the  separable  diagrams  can  recognize  all 
LR  (k)  languages. 


Following  these  papers,  there  have  been  some 
implementations  that  have  been  based  on  the  theory  and 
notation  developed.  [Wirth  1973]  describes  PASCAL  in  terms 
of  syntax  graphs.  [Irons  1971]  uses  the  notation  in  his 
work,  developing  a  parser  based  on  encoded  charts.  These 
two  efforts  both  use  single-return  type  charts.  [Wilcox] 
uses  the  multiple-return  type.  Part  of  his  system  is  an 
assembler  for  a  low-level  code  describing  the  automata. 
This  is  processed  into  a  set  of  recursive  multi-exit 
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procedures,  giving  a  variation  of  the  recursive  descent 
technique  of  translation.  His  parser  is  thus  capable  of 
recognizing  LR  (k)  languages. 

Our  syntax  graphs  allow  only  one  return  point.  As 
stated  above,  a  language  must  be  LL(1)  to  be  processed  by 
our  system.  However,  not  all  LL(1)  grammars  are  acceptable. 
Restrictions  are  pointed  out  in  Section  4.7. 

2.3  An  Example  Language 

In  this  section  we  present  a  description  of  a  small 
language.  This  example  will  be  referred  to  in  later  parts 
of  this  thesis.  This  description  is  then  translated  into 
BNF  and  syntax  charts, 

2.3.1  Description 

In  the  following  description,  the  notation  {text}  means 
that  text  can  appear  zero  or  more  times. 
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A  program  is  a  statement 

A  statement  is  one  of  the  following: 

A:  identif ier=expression ; 

B:  DO  WHILE (expression) ; 
{statement} 

END; 

An  expression  is  one  of  the  following: 
A:  identifier 
B:  integer 

C:  expression+expression 
D:  expression-expression 
E:  expression | expression 
F:  expression&expression 


2.3.2  BNF 


This  BNF  is  suitable  for  a  bottom-up  parsing  compiler. 
The  production 

<while>  : :=  WHILE 

allows  the  semantic  routines  to  prepare  for  proper  handling 
of  the  expression  to  follow;  this  production  is  of  no  value 
to  the  human  reader. 
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<st  atement> 


<program>  :  :  = 

<statement>  ::=  DO  <do  statement> 

|  <assignment> 

<do  statement>  ::=  <while>  (  <expression>  );<statement  list> 

END; 

<while>  : :=  WHILE 


<statement  list> 


<statement  list><statement> 
<empty> 


<assignment>  : :=  <ident if ier>=<expression>; 


<expression> 


::=  <expression>+<primary> 
|  <expression>-<primary> 
|  <expression> | <primary> 
|  <expression>&<primary> 
j  <primary> 


<prim  ar y> 


<integer> 
<ident if ier > 


2.3.3  Charts 

Figure  2.1  gives  the  syntax  charts  for  the  above 
example. 
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2.4  The  Choice  Node 


The  discussion  of  the  next  few  sections  makes  reference 
to  the  example  just  presented. 

A.s  can  be  seen,  the  description  in  charts  is  a  set  of 
directed  graphs  with  labelled  nodes.  Different  classes  of 
information  are  represented  by  different  node  types 
(rectangle,  circle,  ellipse). 

To  allow  descriptions  of  infinite  languages  to  be 
presented,  we  need  a  selection  mechanism  which  can  pick  one 
of  several  paths  through  the  charts.  This  is  provided  by 
the  choice  node  which  looks  like  this: 

- - » 

. - » 

v - * 

There  are  two  types  of  choice  constructs.  The  first  type 
has  all  possible  successors  listed  after  the  choice  node. 

f 

The  chart  for  Primary  in  section  2.3.3  contains  a  choice  of 
this  type.  k  Primary  must  be  either  an  integer  or  an 
identifier. 

The  second  type  lis+s  some  successors,  but  then  has  an 
else  or  otherwise  clause.  This  clause  must  be  the 
physically  last  one,  and  it  begins  with 
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The  chart  for  Statement  in  section  2.3.3  contains  a  choice 
of  this  type.  If,  when  we  reach  the  choice  node,  the  next 
token  of  the  program  is  DO,  then  we  follow  the  first  branch, 
otherwise  we  follow  the  second  one.  (These  two  types  are 
handled  differently  by  the  matching  and  error  repair 
strategies  as  is  explained  in  Chapter  3.) 

2.5  Terminals  and  Non-terminals 

The  main  elements  of  the  charts  are  the  terminals  and 
non- terminals  of  the  grammar.  Terminals  correspond  directly 
to  some  token  of  the  program  to  be  parsed,  like  the  keyword 
DO.  Non-terminals  in dica te  that  some  sequence  of  nodes 
should  be  considered  as  if  it  were  at  this  point  in  the 
description.  A  r. on- ter minal  is  essentially  a  procedure  call 
(with  no  parameters)  in  the  description. 

A  production  call  is  represented  by  an  ellipse  labelled 
with  its  name,  for  example 


There  must  be  a  graph  in  the  description  which  corresponds 
to  rhis  name,  that  is,  which  has  the  name  as  label  on  its 
first  node. 

Terminals  are  represented  by  rectangles  which  may 
contain  one  of  the  following  classes  of  information: 

(i)  special  characters:  punctuation  and  operators 
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(ii)  keywords 


(ill)  constants,  or  syntactic  token  classes,  such  as 
” ident if ier M ,  which  are  recognized  by  the  Scanner 

The  following  are  examples  of  these  three  types: 


2.6  Paragraphing  Symbols 

Paragraphing  is  discussed  at  length  in  Chapter  4.  For 
the  present  section,  suffice  it  to  say  that  paragraphing 
rules  can  be  embedded  in  the  charts.  There  are  three  basic 
paragraphing  commands.  The  first  is  Indent  which  causes  the 
left  margin  to  be  set  one  increment  to  the  right.  The 
second  is  Exdent  which  causes  the  left  margin  to  be  set  one 
increment  to  the  left.  These  two  commands  do  not  imply  a 
new  line  on  the  listing,  but  simply  a  setting  of  the  left 
margin  for  the  next  line.  The  final  command  is  New  Logical 
Line  which  causes  a  new  line  to  be  started  with  the  preset 
left  margin.  These  commands  are  encoded  by  the  following 
a rrow- labelle d  circles: 


por  example,  here  is  a  program  in  the  example  language  of 
section  2.3,  formatted  according  to  the  charts: 
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DO  WHILE  (I)  ; 

1=1"  1 ; 

DO  WHILE  (J)  ; 

J=J"  1 ; 

END  ; 

END; 

2.7  Semantic  Information 

The  last  component  of  the  presentation  is  semantic 
information.  At  any  point  in  the  charts,  a  semantic  routine 
can  be  invoked.  This  is  done  by  writing  a  semantic  action 
identifier  beside  the  graph  and  drawing  a  dotted  line  from 
it  to  the  point  where  the  corresponding  action  is  indicated. 
The  meaning  is  that  when  a  program  is  being  matched  to  the 
description  (i.e.  being  parsed) ,  the  semantic  identifier 
will  be  sent  to  the  Code  Generator  when  the  indicated  point 
in  the  graph  is  reached.  For  example,  consider  the  program 

I=J+1 ; 

based  on  the  charts  of  Section  2.3.3.  The  output  from  the 
Parser  to  the  Code  Generator  would  be 

I,  J,  1,  Add,  Assign. 


This  chapter  has  presented  the  syntax  charts  as  used  in 
this  thesis.  The  next  chapter  will  give  a  linear  notation, 
suitable  for  machine  reading,  which  is  equivalent  to  these 
charts . 
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CHAPTER_3 


A  SYNTAX  LANGUAGE 


The 

notation 
are  not 
that  fit 


syntax  charts  of  the  last  c 
for  design  of  programming  langua 
directly  machine-readable.  W 
on  punched  cards  to  use  as  in 


hapter 
ges.  Ho 
e  needed 
put  to 


are  a  good 
wever,  they 
a  notation 
our  table 


generator. 


This  chapter  describes  the  Syntax  Language  developed  for 
this  purpose;  the  descriptions  produced  in  the  Syntax 
Language  are  equivalent  to  the  corresponding  charts.  This 
chapter  will  show  how  to  write  descriptions  that  can  be 
input  to  the  table  generator;  Chapter  7  will  give  the 
information  needed  to  use  our  table  generator  for 
construction  of  a  parser. 

This  chapter  shows  the  intermediate  step  in  the 
transition  from  charts  (Chapter  2)  to  the  Parser's  syntax 
table  (Chapter  4)  . 


The 


syntax  charts  for  the  Syntax  Language 


are 


in 


Here  is  an  example  of  a  chart  for  one  rule  and  its 


corresponding  Syntax  Language. 


i  oof 


(5x> 


*? — =>  END 


*1 


>  pend 
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1 LOOP_BODY 1  IS 
LABEL 

1 

CHOOSE 

*  END* ; EXDENT  NEW_LI HE  A  'LOOP_END'  RETURN  NEXT 
OTHERWISE  CALL  1  STATEMENT '  BRANCH  1  END 
END  RULE 

A  more  extensive  example  is  contained  in  section  3.6. 

3.1  Structure  and  Control  Symbols 

Each  rule  or  production  from  the  charts  is  encoded  as  a 
separate  paragraph.  The  paragraph  begins  with  the  name  of 
the  rule,  followed  by  the  keyword  IS.  The  paragraph  is 
terminated  by  the  two  successive  keywords  END  RULE.  For 
Example : 

1  STATEMENT '  IS 

•  •  • 

END  FULE 

A  ground  symbol  is  represented  by  the  keyword  RETURN  in  the 
Syntax  Language. 

The  choice  construct  begins  with  the  keyword  CHOOSE. 
Next  comes  the  list  of  possibilities.  Each  possibility. 
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except  the  physically  last  one,  ends  with  the  keyword  NEXT. 
The  last  possibility  is  terminated  by  the  keyword  END.  The 
last  possibility  may  optionally  start  with  the  keyword 
OTHERWISE.  Here  is  an  example  of  a  choice  construct: 

CHOOSE 

*  DO 1  ...  NEXT 
OTHERWISE  ...  END 

The  implications  of  choice  constructs  with  and  without 
otherwise  clauses  are  indicated  in  Section  4.7. 

Branches  are  of  the  form 

ERANCH  label 

The  "label"  must  be  an  integer  which  corresponds  to  a  label 
in  the  production. 

3.2  Terminals  and  Non-Terminals 

There  are  three  classes  of  terminals,  as  outlined  in 
Section  2.5.  In  the  Syntax  Language  they  are  specified  as 
follows: 

a)  special  characters  are  represented  by  themselves 

b)  keywords  are  encoded  as  quoted  character  strings  e.g. 

'DO' 

c)  constants  are  represented  by  the  identifier  naming 

the  constant  class,  such  as  INTEGER,  or  by  the 
appropriate  abbreviation  (see  Appendix  C) 
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A  production  call  (non-terminal)  is  encoded  by  the 
keyword  CALL  followed  by  a  quoted  character  string.  This 
string  must  be  the  name  of  a  rule  appearing  somewhere  in  the 
description. 

3.3  Specifying  Paragraphing  Rules 

In  the  Syntax  Language  the  three  paragraphing  arrows  are 
encoded  by  the  keywords  INDENT,  EXDENT,  and  NEWSLINE. 

3.4  Specifying  Semantics 

The  semantic  identifiers  which  appear  in  the  syntax 
charts  are  given  in  the  Syntax  Language  by  the  keyword  A 
(for  action) ,  followed  by  the  semantic  identifier  in  quotes. 
For  example 

A  1 LOOP_START  *  . 

3.5  Comments 

Our  Syntax  Language  can  have  comments  at  any  place  that 
a  blank  can  appear.  A  comment  is  delimited  by  /*  and  */. 

3.6  An  Example  of  Using  the  Syntax  Language 

Syntax  Language  descriptions  for  different  languages 
appear  in  the  appendices.  The  following  is  the  Syntax 
Language  description  of  the  example  in  Section  2.3. 
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'PROGRAM*  IS 

CALL  'STATEMENT'  A  ' PROGRAM_END '  RETURN  END  RULE 
'STATEMENT'  IS 
CHOOSE 

'DO*  CALL  »DO_LOOP'  RETURN  NEXT 
OTHERWISE  CALL  'ASSIGNMENT*  RETURN  END 
END  RULE 
'DO_LOOP'  IS 

'WHILE'  A  »LOOP_START' (CALL  ' EXPRESSION' ); INDENT 
NEW_LINE  CALL  'LOOP  BODY'  RETURN  END  RULE 
' LOOP_BODY'  IS 
LABEL 

1 

CHOOSE 

'END' ; EXDENT  NEW_LINE  A  'LOOP_END'  RETURN  NEXT 
OTHERWISE  CALL  'STATEMENT*  BRANCH  1  END 
END  RULE 
'ASSIGNMENT'  IS 

IDENTIEIER=CALL  'EXPRESSION'  A  » ASSIGN '; NEW_LINE 
RETURN  END  RULE 
'EXPRESSION'  IS 

CALL  'PRIMARY'  LABEL 
1 

CHOOSE 

♦  CALL  'PRIMARY'  A  'ADD'  BRANCH  1  NEXT 
-CALL  'PRIMARY'  A  'SUBTRACT*  BRANCH  1  NEXT 
| CALL  'PRIMARY'  A  'OR'  ERANCH  1  NEXT 
&CALL  'PRIMARY'  A  'AND'  BRANCH  1  NEXT 
OTHERWISE  RETURN  END 
END  RULE 
'PRIMARY'  IS 
CHOOSE 

INTEGER  RETURN  NEXT 
IDENTIFIER  RETURN  END 
END  RULE 


This  chapter  has  presented  the  Syntax  Language  which  is 
used  to  input  language  descriptions  to  the  table  generator. 
The  next  chapter  describes  the  Parser's  processing  of  each 
type  of  information  in  these  descriptions. 
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CHAPTER  4 


PROCESSING_SYNTAX_TABLES 


The  Parser  has  a  Syntax  Table  which  contains  a  coded 
copy  of  the  syntax  charts  for  the  language  to  be  parsed;  the 
Parser  is  basically  an  interpreter  with  this  table  as  its 
program.  This  chapter  gives  the  order  code  of  the  Parser, 
and  describes  the  processing  associated  with  each  type  of 
"instruction” . 


4.1  The  Parser’s  Order  Code 

The  following  list  gives  all  the  kinds  of  information 
that  can  be  in  the  Syntax  Table. 

1  -  keywords 

2  -  special  characters  (Appendix  B) 

3  -  constants  (Appendix  C) 

4  -  semantic  identifiers 

5  -  Branch 

6  -  Call  Production,  End  Production 

7  -  Choice 

8  -  Indent,  Exdent,  New  Line 
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Each  type  of  information  in  this  list  is  a  direct  analog  of 
an  element  of  the  Syntax  Language. 

The  rest  of  this  chapter  consists  of  a  brief  discussion 
of  the  processing  associated  with  each  entry  in  the  list. 

4.2  The  Scanner  and  Error  Tokens 


The  Scanner's  job  is  to  break  the  source  stream  into 
tokens;  the  Parser  invokes  the  Scanner  when  it  needs  a  token 
to  match  a)  a  terminal  or  b)  a  choice  node. 


Some  of  the  tokens  that  the  Scanner  recognizes  are  the 
constants  of  Appendix  C.  The  last  eight  of  these  are  called 
error  tokens  because  they  do  not  meet  compiler  restrictions, 
for  example,  an  identifier  that  contains  too  many 
characters.  The  Scanner  does  not  print  an  error  message 
when  one  of  these  is  detected.  Instead,  it  passes  the  error 
token  to  the  Parser,  and  the  Lister  eventually  prints  the 
message  (Section  4.2). 


Error  tokens  do  not  match  "good"  tokens  in  the  Syntax 
Table.  For  example,  the  token  BA D_ ID ENT  will  not  match  the 
token  IDENTIFIER.  This  means  that  that  if  error  tokens  are 
to  be  accepted  by  the  Parser,  they  must  be  explicitly  coded 
in  the  description. 

The  Scanner  is  the  only  part  of  the  compiler  that  knows 
about  card  boundaries;  it  does  not  allow  tokens  to  cross 
card  boundaries.  It  sometimes  detects  bad  tokens  based  on 
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this  restriction.  For  example,  if  the  closing  quotation 
mark  were  omitted  on  a  string,  the  Scanner  would  end  the 
string  at  end  of  card  and  mark  it  as  a  bad  token. 

4.3  Terminals 

This  section  deals  with  the  first  three  entries  in  the 
list  of  Section  4.1.  For  a  keyword,  a  special  character,  or 
a  constant  or  identifier,  the  Parser  invokes  the  Scanner  to 
produce  a  token.  If  this  token  is  one  that  i£  required  by 
the  tables,  then  it  is  passed  to  the  Lister  for  printing;  in 
addition,  if  the  token  is  a  constant  or  identifier,  it  is 
sent  to  the  Code  Generator.  The  Parser  then  advances  to  the 
next  entry  in  the  table.  If  the  token  is  not  the  one 
required,  then  the  mismatch  strategy  of  Chapter  5  takes 
control. 

4.4  Semantic  Identifiers 

When  the  Parser  finds  a  semantic  identifier  in  the 
Syntax  Table,  it  passes  it  to  the  Code  Generator;  it  then 
advances  to  the  next  entry  in  the  table. 

4.5  Branches 

A  branch  is  coded  as:  the  instruction  Branch,  followed 
by  the  target's  address.  Branch  addresses  are  relative  to 
the  position  in  which  they  occur  in  the  table.  If  location 
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1-1  in  the  table  is  a  Branch,  then  the  target  is  I  ♦  Syntax 
Table  (I)  . 

4.6  Calling  and  Returning  from  Productions 

The  CALL  of  the  Syntax  Language  takes  two  entries  in  the 
Syntax  Table:  Call  Production  and  the  address  of  the  called 
production.  The  address  points  to  an  absolute  location  in 
the  table;  it  is  encoded  in  order  to  use  a  small  number  of 
bits  to  represent  a  wide  range  of  addresses.  There  are  two 
constants,  named  Call  Production  Alignment  and  Call 
Production  Offset,  that  are  used  in  this- encoding .  Every 
production  starts  in  an  entry  of  the  Syntax  Table  whose 
index  is  evenly  divisible  by  Call  Production  Alignment;  so 
Call  Production  A.lignment  can  be  factored  out  of  the  stored 
address.  After  this  factoring  the  address  is  displaced  by 
Call  Production  Offset  to  take  advantage  of  the  Syntax 
Table's  storage  of  both  positive  and  negative  values;  some 
stored  production  addresses  are  actually  negative  after 
being  encoded.  So,  for  example,  if  location  1-1  in  the 
Syntax  Table  contained  the  instruction  Call  Production,  then 
the  index  of  the  called  production  would  be 

(Syntax  Table (I)  ♦  Call  Production  Offset) 

*  Call  Production  Alignment 

A  Syntax  Table  containing  2  *  (Call  Production  Offset  ♦I)  * 
Call  Production  Alignment  entries  can  be  addressed  by  this 
scheme. 
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When  a  Call  Production  is  processed  by  the  Parser,  the 
index  of  the  entry  following  the  call  address  is  pushed  onto 
a  stack;  the  Parser  then  proceeds  to  the  called  production. 

When  Return  is  encountered  in  the  table,  the  Parser  pops 
the  stack  of  addresses  and  proceeds  to  the  entry  pointed  at 
by  the  popped  address. 

4.7  Choices 

When  the  choice. node  is  reached  by  the  Parser,  it  must 
consider  several  possibilities  for  continuing,  and  select 
one  of  them.  In  the  Syntax  Table  the  Choice  is  followed  by 
a  list  of  addresses  for  the  possibilities.  These  addresses 
are  relative  indices  into  the  Syntax  Table,  like  branch 
addresses.  The  last  address  in  the  list  points  either  to  a 
possibility  beginning  with  Otherwise,  or  to  the  marker  End 
of  Choices,  which  indicates  that  this  choice  has  no 
otherwise  clause. 

For  example,  the  following  choice: 

CHOOSE 

IDENTIFIER  RETURN  NEXT 
BAD_IDENT  RETURN  END 

would  be  encoded  as  follows: 

r  ***  -  — - - - - - - - - - t 

| choose | 3 | 4 | 5 | ident | return | ba dident | return | end  choices | 

L- - - — -- - -  - - - - — -_-J 

Our  Parser  only  allows  terminals  as  the  first  node  of 
each  case  of  a  choice,  except  the  physically  last  one,  which 
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may  have 

OTHERWISE 

instead  of  a  terminal. 

When 

the  Par 

ser 

reaches  a 

choice  in 

the  tables,  it  invokes 

the 

Scanner 

to 

produce  a 

token . 

This  token  is  used 

as  a 

basis 

for 

selecting 

the  path  to  follow.  The  Parser 

tries 

to  ma 

tch 

this  token  against  the  terminals  at  the  heads  of  the  cases 
of  the  choice.  The  order  is  from  the  top  of  the  list  of 
possibilities  to  the  bottom.  If  a  match  is  found,  the 
corresponding  branch  is  followed  by  the  Parser.  If  the 
otherwise  branch  is  reached,  it  is  followed  by  the  Parser. 
If  no  match  is  found,  and  there  is  no  otherwise  branch,  then 
the  token  is  not  syntactically  legal;  the  first  branch  will 
be  taken  and  the  error  repair  strategy  of  Chapter  5  will  be 
invoked.  This  means  that  ordering  of  alternatives  is 
significant  for  error  repair  -  the  first  branch  is  taken  if 
there  is  no  otherwise. 

4 . 8  Paragraphing  Tokens 

When  the  Parser  reaches  one  of  the  paragraphing  tokens 
(Indent,  Exdent,  New  Line)  it  invokes  the  corresponding 
Lister  routine;  it  then  moves  on  to  the  next  entry. 


This  chapter  has  described  how  the  Parser  processes  the 
Syntax  Tables.  The  actions  outlined  here  work  well  when 
there  are  no  errors  in  the  program  being  parsed;  the  next 
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chapter  describes  the  special  processing  necessary  when 
syntax  errors  occur. 
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CHAPTER  5 


ERROR  REPAIR 


Many  programs  that  are  submitted  to  the  Parser  will  have 
syntax  errors.  In  fact,  in  an  educational  environment,  this 
is  probably  true  of  most  programs.  Different  compilers 
react  in  different  ways  to  syntax  errors.  Our  Parser 
repairs  the  program  to  a  syntactically  valid  one,  and 
continues  parsing.  We  use  the  term  "repair"  to  suggest  that 
the  modified  version  is  not  necessarily  what  the  programmer 
intended.  Some  authors  use  the  term  "correct"  for  the  same 
process.  In  this  chapter  we  discuss  some  of  the  reasons  for 
error  repair,  some  related  work,  and  the  particular 
technigue  that  we  have  used. 

5.1  Motivation  for  Error  Repair 

Since  it  is  clearly  more  difficult  for  the  Parser  to 
handle  errors  in  any  non-trivial  way,  rather  than  simply  to 
stop,  the  question  arises  as  to  why  it  should  be  done.  In 
fact,  many  compilers  that  are  widely  used  do  not  have  good 
error  recovery  facilities,  although  there  are  some 
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exceptions  (e.g.  PL/C  [Conway  and  Wilcox]) .  This  seems  to  be 
the  result  of  defining  the  purpose  of  a  compiler  in  a  rather 
narrow  sense  as  something  like  "to  translate  syntactically 
and  semantically  valid  programs".  Given  this  goal,  it  is 
conceivably  acceptable  to  have  error  conditions  cause 
printing  of  a  message  and  termination  of  compilation. 
However,  we  would  define  compilers  as  tools  to  help  humans 
in  the  creation  of  programs.  The  tool  should  help  the  user 
as  much  as  possible  [Horning],  while  not  introducing  any 
errors  of  its  own.  As  [Knuth  1974]  says: 

"Another  important  way  for  a  production  program  to  be 
good  is  for  it  to  interact  gracefully  with  its  users, 
especially  when  recovering  from  human  errors  in  the 
input  data." 

Syntax  repair  is  recovery  with  explicit  communication  of  its 
assumptions  to  the  user. 

Our  Parser  was  meant  to  be  used  in  an  educational 
environment.  Many  inexperienced  users  would  be  submitting 
programs  to  it;  we  wanted  to  have  the  compiler  help  make  it 
easy  to  learn  to  program.  Syntax  repair  makes  a 
modification  and  the  Parser  then  continues  processing  the 
program  so  errors  do  not  inhibit  further  checking.  Through 
the  use  of  syntax  repair,  we  hoped  to  cut  down  the  effort 
required  to  get  a  program  working  correctly.  In  addition, 
the  more  that  can  be  done  on  one  run  (within  reason) ,  the 
fewer  runs  will  be  made,  and  hence  compilation  cost  will  go 
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down.  [Conway  and  Wilcox]  point  out  that  observation  by 
users  in  different  installations  using  PL/C  indicates  that 
30  to  50  per  cent  fewer  runs  are  needed  on  a  syntax¬ 
repairing  compiler. 

One  principle  of  error-handling  we  felt  to  be  important 
is  localization  of  errors  and  their  effects.  That  is,  we 
wanted  our  Parser  to  recover  from  an  error  on  input  in  the 
space  of  a  very  few  tokens.  The  remainder  of  the  program 
could  then  be  checked  for  more  errors. 

Another  principle  was  that  the  output  caused  by  errors 
have  a  high  " signal- to- noise”  ratio.  That  is,  we  wanted  a 
little  text  to  convey  a  great  deal  of  information.  To  this 
end,  we  use  only  one  message  for  all  syntax  errors,  and  it 
occurs  at  most  once  per  line.  In  addition  to  the  message, 
we  also  display  the  repaired  source.  According  to  [Conway 
and  Wilcox]: 

"...most  effective  communication  rests  not  in  the 
specific  messages  but  in  a  reconstruction  of  the  source 
statement  that  syntactic  analysis  is  actually  passing  on 
to  semantic  analysis." 

For  example,  for  the  PL/I  input  line 

PUT  L JST (A ,B ; 

we  would  display  the  repaired  line 
PUT  LIST  (A, B)  ; 
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The  repaired  line  actually  gives  the  Parser's  assumptions  in 
terms  of  the  original  program,  which  the  user  can 
understand. 

Finally,  syntax  error  repair  allows  execution  of  some 
part  of  a  program,  even  though  errors  may  prohibit  complete 
execution.  This  can  be  accomplished  because  syntactically 
correct  programs  are  always  produced.  In  many  cases  even  if 
there  is  an  error  in  the  program  at  least  the  first  few 
lines  may  execute  and  print  the  headings,  thus  allowing 
verification  of  format.  In  this  way,  a  program  can  be 
partially  debugged  even  while  parts  of  it  contain  errors. 

An  argument  against  error  repair  is  that  students  will 
tend  to  believe  the  Parser.  That  is,  they  will  assume  that 
the  repair  is  actually  a  correction.  We  try  to  avoid  this 
by  the  wording  of  our  error  message  (which  says  that  the 
line  was  "replaced"  rather  than  corrected) .  In  addition,  in 
teaching  we  stress  that  the  machine  makes  an  attempt,  that 
will  very  often  be  wrong,  to  fix  the  program  so  that 
translation  can  continue,  and  that  the  assumptions  should  be 
checked  by  the  user. 

One  final  consideration  reflects  the  implementor's, 
rather  than  the  user's,  point  of  view.  Syntax  repair 
ensures  that  the  later  passes  of  the  compiler  need  not  check 
that  the  program  has  valid  syntax. 
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5.2  Related  Work  on  Syntax  Repair 

There  have  been  many  techniques  developed  for  handling 
errors  detected  during  parsing.  We  are  interested  primarily 
in  those  that  have  been  based  on  the  syntax  of  the  language 
being  compiled,  rather  then  on  particular  features  of  one 
specific  language.  In  this  section  we  mention  a  few 
significant  papers. 

An  early  paper,  [Irons  1963],  discusses  a  method  which 
develops  all  possible  parses  of  a  string.  An  error  is 

"V. 

detected  when  no  parses  can  be  continued  past  a  certain 
input  symbol.  At  that  point,  a  list  of  acceptable  inputs  is 
produced,  and  input  is  discarded  until  one  of  these  is 
found.  A  string  of  terminals  may  then  have  to  be  inserted 
before  the  acceptable  input.  Some  of  the  selection  details 
of  this  algorithm  are  not  made  clear.  [Leinius]  describes  a 
technique  whereby  a  potential  phrase  of  the  grammar  is 
isolated  containing  the  detected  error.  All  possible 
reductions  are  tried  until  one  is  found,  or  if  none  exists, 
a  larger  potential  phrase  is  attempted.  [James]  based  his 
approach  on  that  of  Leinius,  but  limited  the  size  of  the 
potential  phrase  to  be  considered.  He  also  used  spelling 
correction  derived  from  [Morgan].  [Graham  and  Rhodes] 
describe  another  variation.  In  their  system,  before  a 
symbol  is  stacked,  a  check  is  done  to  ensure  that  this 
stacking  would  not  invalidate  the  configuration  as  the  right 
hand  side  of  some  production  in  the  grammar,  as  this  is  an 
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error.  If  an  error  does  occur,  the  system  tries  to  do  some 
reduction  in  the  immediate  neighbourhood  of  the  error  to 
gain  information  before  trying  to  do  the  repair. 

These  methods  are  all  relatively  successful.  But  the 
scheme  that  we  adopted  is  somewhat  simpler,  and  requires 
very  little  processing.  We  also  wanted  to  have  the 
advantage  of  some  systems  that  are  particularly  adapted  to 
one  language.  [Wirth  1968]  says: 

"In  most  cases,  syntactic  errors  are  due  to  omission  or 
wrong  use  of  symbols  merely  conveying  information  about 
structural  properties  of  the  program,  such  as  commas, 
semi-colons,  and  the  various  kinds  of  brackets. 
Omission  of  elements  explicitly  denoting  program 
activity,  such  as  operands  and  operators,  is  rare.” 

He  uses  heuristics  to  handle  the  frequent  cases 
intelligently,  and  is  not  concerned  with  the  large  majority 
of  cases  which  seldom  occur.  [Conway  and  Wilcox]  also  do 
not  have  a  "general  model  or  theory"  for  handling  errors, 
but  consider  special  cases.  We  wanted  a  fast  system  of 
error  repair,  that  would  be  biased  towards  repairing  errors 
commonly  made  by  neophyte  programmers.  The  approaches  of 
[Wirth  1968]  and  [Conway  and  Wilcox]  certainly  have  that 
property,  but  we  also  wanted  to  specify  our  strategy  in 
terms  of  the  syntax  elements  of  the  language. 

All  syntax-directed  error  schemes  use  some  combination 
of  the  following  actions:  1) delete  from  parse  stack,  2) add 
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to  parse  stack,  3) discard  source,  4) generate  source.  Our 
Parser  uses  only  the  last  two  of  these  actions.  It  generates 
source  tokens,  or  deletes  source  tokens.  The  modified 
version  of  the  program  is  displayed  for  the  user  in  terms  of 
the  source  language. 

The  technique  that  our  Parser  uses  has  three  levels  of 
consideration.  The  next  section  deals  with  the  lowest 
level,  which  handles  errors  within  lines.  The  second  level, 
which  is  discussed  in  Section  5.4,  handles  the  synchronizing 
of  the  Syntax  Tables  and  the  input  stream  at  the  end  of  a 
line.  The  final  level  synchronizes  input  and  tables  at  the 
end  of  a  program;  it  is  discussed  in  Section  5.5. 

5.3  Handling  Tokens 

The  Parser  must  now  decide  how  to  treat  the  source  token 
which  does  not  match  its  tables.  Common  causes  of  mismatch 
are: 

(i)  misspelled  keyword 

(ii)  omitted  redundant  punctuation,  e.g.  parentheses 
around  MAIN  in  PL/I  procedure 

(iii)  extra  punctuation,  e.g. extra  parentheses. 

The  following  simple  strategy  attempts  to  handle  these 
common  cases  in  a  reasonable  way.  The  token-level  recovery 


strategy 
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1)  always  generates  the  next  terminal  in  the  tables 

2)  if  the  generated  token  is  a  special  character  and  the 

input  token  is  not,  then  the  input  token  is  reused. 

Otherwise  it  is  discarded  and  the  next  token  is 

read . 

The  Parser  can  only  generate  the  following  tokens: 

-  if  an  identifier  is  needed,  $NIL  is  generated 

-  if  an  integer  is  needed,  1  is  generated 

-  if  a  string  is  needed,  •?•  is  generated 

-  any  special  character 

-  any  keyword. 

This  token-level  recovery  strategy  reacts  to  the  common 

syntax  errors  listed  above  in  the  following  way.  A 

misspelled  keyword  would  mean  an  identifier  passed  by  the 
Scanner  was  mismatched  with  a  keyword  in  the  table.  The 

Parser  would  generate  and  use  the  desired  keyword,  and  then 

decide  to  discard  the  input  identifier.  This  results  in 
essentially  a  spelling  correction  of  the  keyword, 
accompanied  by  a  syntax  error  message.  This  only  works  if 
the  keyword  is  redundant.  For  example: 

PUT  LJST  (X)  ; 
would  be  repaired  to 


47 


P(JT  LIST  (X)  ; 


A  misspelling  of  a  nor. -redundant  word  like 
POO  X=1  TO  2; 

would  not  be  repaired  correctly.  (In  fact,  we  would  take 
DOO  as  an  identifier  and  construct  an  assignment  statement.) 
Spelling  correction,  as  in  [Morgan],  could  be  applied  here 
if  we  also  had  backup  in  the  Parseir.  As  soon  as  the  X  is 
found,  ve  would  back  up  and  try  the  spelling  correction  or. 
DOO. 

In  the  case  of  omitted  redundant  punctuation,  if  the 
punctuation  is  omitted  before  a  variable  or  keyword,  then 
our  scheme  will  insert  it.  However,  if  the  omitted  special 
character  is  followed  by  another  special  character,  the 
scheme  will  discard  the  following  one,  and  if  it  was 
correct,  regenerate  it  or  some  other  correct  symbol  on  the 
next  cycle,  as  there  will  be  another  mismatch  on  the  next 
token.  For  example 

X=Y / (2+SQRT  (Z)  ; 

is  repaired  to 

X= Y/  (2+SQP.T  (Z)  )  ; 

The  redundant  parentheses  are  omitted  in  the  following 
statement : 

PUT  LIST  X , Y ; 
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The  statement  is  repaired  to 


PUT  LIST  (X, Y)  ; 

Extra  punctuation  is  discarded.  For  example 
DO  WHILE  (1=2) )  ; 
is  repaired  to 

DO  WHILE  (1=2)  ; 

5.4  Localizing  Errors  to  a  Line 

The  token  level  strategy  works  well  within  lines,  but 
since  we  want  to  guarantee  that  errors  are  localized  to  one 
line,  we  introduce  a  refinement  of  the  technique.  To 
localize  errors  to  a  line  the  Parser  needs  to  have  some 
method  of  recognizing  the  end  of  a  line.  For  this  purpose,  a 
subset  of  tokens  has  been  defined  as  line  terminators.  These 
are  treated  in  a  special  way  in  a  mismatch. 

In  many  programming  languages,  and  in  particular  in 
PL/I,  many  statements  start  with  a  keyword,  or  end  with  some 
punctuation,  like  a  semi-colon.  When  a  mismatch  between  the 
source  program  and  the  Syntax  Table  of  the  Parser  occurs,  we 
can  take  advantage  of  this  fact  to  aid  in  limiting  the 
effect  of  errors  to  one  logical  line.  The  Parser  has  a  list 
of  keywords  which,  together  with  the  semi-colon,  it 
considers  to  be  line  terminators.  If  either  the  source 
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token  or  the  token  to  be  matched  in  the  tables  is  a  line 


terminator,  then  the  processing  described  in  the  previous 
section  is  not  done.  Instead  we  follow  one  of  the  courses  of 
action  outlined  below. 

If  the  source  token  is  a  line  terminator,  the  Parser 
enters  the  Generate  Line  State.  This  means  that  it  will 
generate  sufficient  tokens  (one  at  a  time)  to  match  all 
terminals  in  the  Syntax  Table  up  to  the  next  line  terminator 
in  the  table.  Then,  if  the  generated  line  terminator  is 
equal  to  the  source  token,  the  source  token  will  be  ignored, 
otherwise  it  will  be  used  again  in  the  next  attempt  to  match 
the  Syntax  Table.  However,  it  could  be  the  case  that  a 
particular  line  terminator  appears  on  input,  but  is  not 
valid  in  the  tables  for  some  distance  greater  than  that  to 
the  next  line  terminator.  That  is,  a  line  terminator  may 
appear  on  input  and  cause  the  Generate  Line  State,  and  then 
be  reused,  but  fail  to  match  the  table  again.  This  could 
occur,  for  example,  if  an  ELSE  appeared  outside  an  IF 
construct.  Here  we  have  a  line  terminator  that  will  not  be 
found  in  the  tables  even  though  we  keep  generating.  To  solve 
this  potential  infinite  loop  problem,  the  Parser  allows  an 
input  line  terminator  to  cause  the  Generate  Line  State  only 
once.  Therefore,  when  a  mismatch  occurs  on  an  input  line 
terminator,  and  that  terminator  has  already  caused  one  line 
to  be  generated,  then  the  source  token  is  discarded  and  the 
Scanner  is  invoked  to  replace  it. 
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» 

The  line  level  recovery  strategy  allows  creation  of 
completely  redundant  lines  such  as 

$NIL: PROCEDURE  OPTION S  (MAIN)  ; 
in  SP/k. 

A  different  course  of  action  is  pursued  when  the  input 
token  is  not  a  line  terminator  but  the  Syntax  Table  token  is 
a  line  terminator.  This  situation  arises  when  there  are 
superfluous  source  tokens  at  the  end  of  a  line.  In  this  case 
the  Parser  must  skip  the  extra  input  tokens,  until  it  finds 
a  line  terminator  on  input.  The  Parser  then  returns  to  its 
normal  processing  mode. 

Consider  the  following  incorrect  program  fragment: 

X=X  + ; 

X=X+1  EXTRA 

PUT  LIST  (X)  ; 

This  would  be  repaired  to: 

X=X+ 1 ; 

X=X+1 ; 

PUT  LIST  (X)  ; 

In  rhis  section  we  have  looked  at  the  line  level 
strategy.  But  we  also  need  a  way  to  detect  the  end  of  a 
program.  This  is  described  in  the  next  section. 

5.5  Handling  Program  Termination  Errors 

As  well  as  line  terminators,  there  are 
program  terminators.  These  are  the  job  control  words  such  as 
$ JOB  and  $D ATA  which  serve  to  delimit  a  program.  Program 
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terminators  trigger  the  Parser's  highest  level  of  recovery  - 
the  program  level. 

Since  program  terminators  are  not  part  of  the  syntax  of 
the  language  being  compiled,  they  do  not  appear  in  the 
Syntax  Table.  The  Parser  achieves  normal  termination  by 


determining  that  it 

has 

matched 

the 

rule  for 

the 

goal 

symbol.  However, 

these 

tokens 

do 

appear  in 

the 

input 

stream.  The  only  correct  place  for  a  program  terminator  to 
appear,  from  the  Parser's  point  of  view,  is  as  the  next 
source  token  following  the  match  of  the  goal  non-terminal. 
If,  instead,  more  source  tokens  appear  before  a  program 
terminator,  a  message  is  produced  and  they  are  skipped.  An 
alternate  strategy  at  this  point  might  be  to  discard  the 
last  symbols  read.  In  PL/I,  this  would  be  the  END;  that 
terminated  the  main  procedure,  and  probably  occurred  too 
early.  However,  this  would  only  work  for  languages  that 
have  a  distinguished  token  which  signals  the  end  of  program. 

The  other  error  that  can  occur  is  premature  input  of  a 
program  terminator,  which  means  that  the  program  text  is 
incomplete.  This  particular  error  must  be  considered  in  the 
mismatch  processing.  It  is  possible  that  we  could  have  a 
program  terminator  as  a  source  token  mismatched  against  the 
Syntax  Table.  If  this  situation  arises,  the  Parser  will  go 
into  the  Generate  Program  State.  This  state  lasts  until  the 
Parser  terminates,  having  generated  tokens  until  return  from 
the  goal  symbol.  The  Parser  is  able  to  produce  a 
syntactically  correct  remainder  for  any  program.  In  PL/I, 
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Syr.tax  Table.  If  this  situation  arises,  the  Parser  will  go 
into  the  Generate  Program  State.  This  state  lasts  until  the 
Parser  terminates,  having  generated  tokens  until  return  from 
the  goal  symbol.  The  Parser  is  able  to  produce  a 
syntactically  correct  remainder  for  any  program.  In  PL/I, 
this  usually  means  generating  END  statements  to  close  off 
groups  and  procedures. 


The  following  program  is  incomplete: 


T  :  PPOCEDUF.E  OPTIONS  (MAIN)  ; 
DECLARE  (I,  J)  FIXED; 

DO  1=1  TO  3; 

GET  LIST  (J)  ; 

This  would  be  repaired  to: 


T : PROCEDURE  OPTIONS (MAIN)  ; 

DECLA.F.E  (I,  J)  FIXED; 

DO  1=1  TO  3; 

GET  LIST  (J)  ; 

END; 

END; 

There  is  a  potential  problem  in  this  strategy.  Most 
languages  allow  an  indefinite  number  of  statements  in  a 
program.  If  this  is  the  case,  the  Parser  must  decide  to  take 
the  path  through  the  tables  that  will  lead  to  the  end  of  the 
program,  instead  of  going  into  an  infinite  loop  generating 
statements.  The  chart  for  a  statement  list  might  be  of  the 
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the  following  way:  if  the  Parser  is  in  Genera 
State  and  attempts  to  pass  through  a  choice  no 
one  of  its  options  starting  with  the  keyword  END, 
option  is  taken.  This  scheme  will  work  only  if 
END  is  the  End  of  List  terminal  in  the  lang 
compiled.  This  is  a  definite  restriction,  but  s 
us  to  handle  a  large  class  of  interesting  langua 
simple  strategy  could  be  generalized  to  allow  o 
List  symbols. 


te  Program 
de  that  has 
then  that 
the  keyword 
uage  being 
till  allows 
ges.  This 
ther  End  of 


This  chapter  and 
and  the  Parser,  and  the 
Neither  the  Scanner 
printing  is  done  by  the 
next  chapter. 


Chapter  4  have  discussed  the  Scanner 
error  repair  strategy  of  the  Parser, 
nor  the  Parser  does  any  printing;  all 
Lister  which  is  described  in  the 
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CHAPTER  6 


THE  LISTER  AND  PARAGRAPHING 


In  our  compiler  the  source  listing  is  printed  by  the 
logical  unit  called  the  Lister.  Tokens  passed  to  the  Parser 
by  the  Scanner  are  then  passed  to  the  Lister  by  the  Parser. 
The  listing  that  is  produced  is  intended  to  reflect  the 
logical  structure  of  the  program,  rather  than  the  physical 
structure  of  the  input  text.  This  chapter  mentions  some 
reasons  for  compiler-controlled  paragraphing,  and  discusses 
the  details  of  our  Lister. 

One  of  the  attractive  features  of  the  Lister  is  its 
logical  simplicity.  It  has  only  three  formatting  commands: 
Indent,  Exdent,  New  Line.  The  commands  themselves,  and 
their  interactions,  are  easy  to  understand. 

Section  6.7  contains  an  example  of  Lister  output  based 
on  the  language  of  Section  2.3.  It  demonstrates  the 
printing  of  error  messages  and  can  be  compared  with  the 
charts  to  see  how  the  paragraphing  commands  affect  the 
listi ng. 
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Here  we  give  a  sample  PL/I  program,  without  errors,  to 
demonstrate  the  kind  of  formatting  we  are  referring  to.  The 
body  of  the  DO  group,  the  procedure,  and  the  clauses  of  the 
IF  statement  are  indented. 


LIST: PROCEDUR E  OPTION S  (MAIN)  ; 

/*  THIS  PROGRAM  READS  10  NUMBERS,  PRINTS  THEIR 
/*  ABSOLUTE  VALUES  AND  THE  ALGEERAIC  SUM 
DECLARE (I, NUMBER, SUM)  FIXED; 

SUM=0 ; 

DO  1=1  TO  10; 

GET  LIST  (NUMBER)  ; 

IF  NUMBER>=0  THEN 
PUT  LIST  (NUMBER)  ; 

ELSE 

PUT  LIST (-NUMBER) ; 

SUM= SUM* NUMBER; 

END; 

PUT  LIST (’ SUM=*  ,  SUM)  ; 

END; 


6.1  Rationale  for  Paragraphing 


Most  compilers  produce  source  program  listings  with  one 
line  corresponding  to  one  card  (or  record)  of  the  input. 
The  listing  is  a  copy  of  the  input,  with  line  numbers.  Our 
Lister  prints  programs  one  token  at  a  time,  and  provides 
setting  of  tabs  and  new  lines  as  directed  by  the  Parser,  not 
by  the  record  boundaries  of  the  input*  Some  other  systems, 
such  as  IBM’s  PL/I  Checkout  Compiler  [IBM]  and  notably  the 
SUE  System  Language  [Clark  and  Ham],  provide  such  a  feature 
as  well.  However,  since  it  is  not  common  to  have  a 
paragraphing  compiler,  the  guestion  arises  as  to  why  expend 
the  effort  to  build  one,  when  a  simple  line-for-card  listing 
can  be  easily  done. 


*/ 

*/ 
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It  was  indicated  above  that  the  goal  of  the  Lister  is  to 
display  the  logical  structure  of  the  program  on  the  printed 
page.  We  have  two  degrees  of  freedom,  which  we  use  in  the 
following  way: 

-  each  statement,  or  logical  entity  goes  on  a  new  line, 

so  vertical  spacing  reflects  logical  sequence 

-  the  body  of  compound  statements  is  indented,  so 

horizontal  spacing  reflects  logical  subordination, 
all  information  at  one  logical  level  being  listed 
with  a  common  left  margin. 

Paragraphing  of  source  text  is  intended  to  aid  human 
understanding  of  programs.  It  has  been  recommended  by  some 
people  primarily  on  the  basis  of  their  own  successful 
experience.  Weissman*s  thesis  [Weissman]  provides 
experimental  data  which  indicates  that  paragraphing  reduces 
the  psychological  complexity  of  programs. 

The  aim  of  a  program  listing  is  to  convey  information 
about  the  program  to  the  human  reader,  many  times  to  someone 
other  than  the  programmer.  To  this  end  we  want  to  reduce 
the  amount  of  unfamiliar  information  on  the  page.  The  fewer 
new  things  that  are  met  with  in  a  document,  the  easier  it  is 
to  comprehend.  We  certainly  do  not  want  to  introduce  any 
unnecessary  unfamiliarity  in  the  medium  of  presentation. 
This  would  indicate  that  consistency  of  paragraphing  style 
is  very  important.  It  is  easier  to  read  a  program 
paragraphed  in  a  familiar  style  than  it  is  to  read  a  program 
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paragraphed  in  an  unfamiliar  style.  A  consistent  set  of 
paragraphing  rules  is  easily  enforced  if  the  compiler 
produces  formatted  listings.  A  consistent  paragraphing 
style  can  also  be  achieved  by  setting  standards  for  a  group 
of  people  to  follow;  but  this  is  more  difficult  to 
administer.  Different  individuals  have  different  ideas  of 
what  constitutes  a  good  set  of  paragraphing  rules;  in  the 
absence  of  evidence  to  indicate  what  paragraphing  rules  are 
better  than  others,  it  is  at  least  desirable  to  have  one  set 
of  rules  and  to  stick  to  it. 


Another  consideration  is  that  it  is  advantageous  to 
convey  to  a  programmer  what  structure  the  compiler  has 
recognized.  This  is  conveyed  in  two  ways:  syntax 
(especially  the  keyword  END)  and  paragraphing.  One  is 
completely  redundant.  This  particular  redundancy  is 
valuable  for  error  detection.  It  enables  him  to  match 
against  what  he  desired  the  structure  to  be,  and  make  any 
necessary  corrections,  such  as  inserting  END  statements. 
Some  compilers,  like  PL/C  and  IBM’s  PL/I  compilers,  use  two 
numbers  on  each  line  of  a  source  listing  in  an  attempt  to 
convey  this  information.  One  of  these  numbers  is  called 
'•level”  and  counts  the  number  of  scopes  (PROCEDURES  and 
BEGINS)  that  are  currently  open.  The  other  is  called  "nest” 
and  counts  the  number  of  DO  groups  that  are  currently  open. 
This  form  is  difficult  to  read.  Automatic  paragraphing 
gives  the  structuring  information  in  the  indentation  of 
code,  which  is  more  easily  comprehended. 
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An  advantage  of  having  the  compiler  enforce  paragraphing 
conventions  is  that  program  modifications  do  not  necessitate 
a  newly- paragraphed  deck.  Rather,  the  new  compilation 
listing  has  paragraphing  corresponding  to  the  new  structure. 
In  a  teaching  environment  it  is  difficult  to  get  students  to 
repunch  programs.  Compiler-formatted  listings  that  always 
reflect  the  program* s  logical  structure  help  to  teach  the 
value  of  visually  structuring  the  program. 

In  summary,  paragraphing  aids  human  comprehension  of 
programs,  and  there  are  benefits  to  be  derived  from 
enforcing  the  paragraphing  conventions  via  the  compiler. 

Our  Lister  provides  paragraphed  printouts.  It  is  called 
by  the  Parser  to  i) list  source  tokens,  ii) provide  horizontal 
spacing,  iii) provide  vertical  spacing,  and  iv) handle 
comments.  These  functions  are  discussed  in  the  following 
sections. 

6.2  Printing  Source  Tokens 

The  Lister  has  the  responsibility  of  printing  both 

1)  the  tokens  of  the  source  program 

2)  the  tokens  which  correspond  to  the  syntactically 

repaired  program. 

We  will  use  the  term  list  to  mean  "print  a  token  of  the 
source  program".  Every  token  of  the  source  program  is 
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listed  exactly  once.  We  will  use  the  term  save  in  the 
following  sense.  Any  token  of  the  syntactically  repaired 
program  is  saved  for  use  in  generating  code;  the  tokens  that 
are  saved  either  are  generated  by  the  Parser  or  are  source 
tokens.  Putting  this  another  way,  the  source  program  is 
listed,  but  the  repaired  program  is  saved . 

The  Parser  instructs  the  Lister  to  print  the  source 
program  by  calling  one  of  the  following  three  routines: 

1)  List  and  Save  Token:  This  routine  is  called  when  a 

source  token  is  syntactically  valid,  and  is  to  be 

used  in  the  translated  version  of  the  program.  The 

« 

token  is  printed  in  the  source  line,  and  is  saved 
in  the  repaired  line. 

2)  List  and  Do  Not  Save  Token:  This  routine  is  called  by 

the  Parser  when  a  source  token  is  syntactically 
invalid,  and  is  being  discarded.  The  token  is 
printed  even  though  it  is  not  used  in  the 
translated  version. 

3)  Save  Token:  This  routine  is  called  when  the  Parser 

has  generated  a  token.  Since  the  token  did  not 
appear  on  input,  it  will  not  be  listed,  but  it  is 
saved  as  part  of  the  valid  program. 

When  a  syntax  error  occurs,  the  Lister  displays  a  copy 
of  the  repaired  line  after  the  source  text  line  in  error. 
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The  source  line  contains  all  listed  tokens;  the  repaired 
line  contains  all  saved  tokens. 

An  error  flag  is  inserted  into  a  source  line  at  the 
point  of  detection  of  the  first  error  on  a  physical  line  of 
text . 

These  Lister  routines  also  handle  the  printing  of  error 
messages  for  the  "bad”  tokens  mentioned  in  Section  4.2. 
(These  are  the  last  eight  constant  types  in  Appendix  C.) 
After  printing  one  of  these  tokens,  the  Lister  invokes  the 
procedure  Put  Scanner  Error  Message.  This  procedure  causes 
the  partially-filled  line  containing  the  error  to  be 
printed,  followed  by  an  error  message.  Then  the  next 
physical  line  is  filled  with  blanks  over  to  where  the  error 
token  ended,  and  normal  processing  continues.  The  message 
gets  printed  but  the  Lister  logic  is  not  affected.  As  far 
as  it  knows,  nothing  has  happened  to  the  line  of  text.  Thus 
these  errors,  which  occur  relatively  infrequently,  are 
handled  locally  by  the  Lister.  An  example  of  one  of  these 
errors  is  given  in  Section  6.7. 

6.2.1  Formatting  Within  Lines 

Just  as  we  need  rules  for  indenting  lines  and  vertical 
spacing,  so  we  must  consider  the  placing  of  tokens  on  a 
line.  Some  token  pairs  cannot  be  juxtaposed  without 
intervening  blanks.  For  example  two  identifiers,  such  as  I 
and  J,  if  juxtaposed  would  look  like  one  identifier,  IJ. 
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Other  tokens,  such  as  parentheses,  do  not  need  to  be 
separated  from  the  tokens  before  or  after  them.  Our  Lister 
uses  this  very  simple  strategy:  when  a  token  is  to  be  placed 
on  a  line,  if  neither  that  token  nor  the  previous  one  are 
special  characters  (Appendix  B) ,  then  a  blank  is  inserted 
before  the  token,  otherwise  it  is  placed  in  immediate 
juxtaposition  to  the  previous  token.  This  means  that  two 
identifiers  would  be  separated  by  a  blank,  but  parentheses 
would  have  no  blanks  on  either  side.  (This  rule  can  be 
changed  by  minor  modifications  to  the  Lister.) 

6.3  Logical  Lines  and  Physical  Lines 

i 

The  Parser  wants  to  list  all  tokens  that  it  encounters 
between  two  New  Line  commands  in  its  tables  on  the  same 
line.  This  is  called  a  logical  line.  The  Parser  ignores 
the  fact  that  the  length  of  physical  lines  is  bounded,  and 
that  a  logical  line  may  not  fit  on  the  printer.  It  can 
behave  as  if  the  page  were  wide  enough  to  take  any  logical 
line . 


Since  any  device  used  for  a  printer  has  a  finite  line 
size,  we  also  need  another  concept,  physical  lines  .  These 
are  the  actual  records  which  are  printed.  Several  physical 
lines  may  correspond  to  one  logical  line;  each  logical  line 
is  represented  by  at  least  one  physical  line.  In  other 
words,  records  are  printed  for  one  of  two  reasons: 
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1)  The  Parser  encounters  a  New  Line  symbol  in  its  Syntax 

Table  and  thus  determines  that  a  logical  line  is 
complete.  As  a  result,  it  sends  a  New  Line  command 
to  the  Lister. 

2)  Due  to  the  limited  length  of  the  physical  print  line, 

no  more  tokens  of  the  present  logical  line  will  fit 
on  the  current  physical  line. 

In  either  case,  if  a  syntax  error  occurred  on  the  physical 
line  just  completed,  a  message,  and  the  repaired  text,  are 
printed  following  the  source  text.  (The  example  at  the  end 
of  this  chapter  demonstrates  this.)  Case  1)  is  handled  by 
the  Lister  routine  named  New  Logical  Line.  It  prints  a  line 
number  and  tabs  over  to  the  current  margin  position.  Case 
2)  causes  the  invocation  of  the  procedure  New  Physical  Line 
from  inside  the  Lister.  The  Lister  would  have  been  invoked 
through  one  of  the  entry  points  named  in  the  last  section. 
The  continued  portion  of  the  logical  line  is  indented  one 
level  more  than  is  current,  if  the  line  has  not  already  been 
continued.  No  line  number  is  assigned  to  continuation 
lines . 

Very  long  tokens,  such  as  a  character  string  that  fills 
a  card,  cause  a  special  problem.  If  we  are  using  a 
teletype,  a  long  token  will  not  fit  on  one  line.  If  this 
happens,  then  case  2)  of  the  previous  paragraph  will  hold, 
and  the  routine  called  New  Physical  Line  will  be  invoked. 
This  procedure  checks  the  token  that  caused  its  invocation 


63 


to  see  if  it  is  too  large  to  fit  on  a  print  line,  and  if  so, 
prints  as  much  of  it  as  will  fit  on  a  completely  exdented 
line.  It  then  moves  the  remaining  characters  to  the  front  of 
the  buffer,  leaving  them  to  be  handled  by  the  normal  logic. 
It  also  sets  up  for  another  physical  line.  (It  is  always 
true  that  a  long  token  will  fit  on  one  exdented  line  plus 
one  indented  line.) 

One  other  problem  that  is  handled  by  the  New  Physical 
Line  routine  arises  when  the  token  forcing  a  new  line  is  too 
long  for  the  indented  line,  but  will  fit  on  an  exdented 
line.  This  problem  is  handled  by  temporarily  leaving  the 
left  margin  at  the  extreme  left  of  the  page. 

6.4  Tab  Setting 

The  Lister  has  two  entry  points,  called  Indent  and 
Exdent,  which  are  used  for  changing  the  left  hand  margin  of 
the  printed  text.  These  entry  points  are  used  by  the  Parser 
when  it  encounters  the  symbols  Indent  or  Exdent, 
respectively,  in  its  Syntax  Table. 

A.  Tab  Increment  is  defined  (currently  set  at  three 
spaces) ,  and  each  Indent  causes  setting  of  the  left  margin 
one  increment  to  the  right.  Exdent  correspondingly  causes 
it  to  be  set  one  increment  to  the  left.  Since  there  really 
is  a  finite  maximum  page  width,  and  since  we  must  always 
have  some  room  for  printing,  there  is  also  a  maximum 
indentation.  When  this  is  reached,  no  further  indentation 
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is  performed,  but  rather  the  Indent  commands  are  simply 


counted.  All  lines  and  continuations  are  listed  at  this 
maximum  indentation  level  until  there  are  sufficient  Exdent 


commands 

to 

return  the 

logical 

margin 

to  the  physical 

margin. 

At 

this  time 

physical 

margin 

changes  start 

occurring 

as 

before. 

6.5  Listing  Comments 

The  mechanism  that  has  been  outlined  so  far  in  this 
chapter  is  used  for  all  of  the  source  program  except  the 
comments.  They  are  not  handled  by  the  Parser,  and  never 
occur  in  the  Syntax  Tables.  Instead,  there  is  a  small  piece 
of  code  at  the  interface  between  the  Scanner  and  Parser 
which  invokes  the  Lister  routine  List  Comment  whenever  the 
Scanner  produces  a  comment. 

This  routine  handles  comments  in  one  of  two  ways: 

1)  a  comment  occurring  immediately  after  a  New  Line  in 

the  tables  will  be  listed  on  a  line  by  itself,  at 
the  same  indentation  level  that  the  next  line  of 
source  will  be  listed  at. 

2)  a  comment  embedded  in  a  line  will  be  listed  in  the 

text  of  that  line. 

List  Comment  examines  the  line  that  is  currently  being 
built  for  the  printer.  If  the  comment  is  the  first  thing  to 
go  into  the  line,  then  it  is  listed  at  the  current 
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indentation  level  with  no  line  number,  and  a  line  number  is 
provided  for  the  next  line  to  be  built.  The  comment  is  the 
only  source  token  to  go  on  the  line.  If,  on  the  other  hand, 
the  current  line  already  has  some  tokens  on  it,  then  the 
Lister’s  entry  point  List  and  Save  Token  is  given  control  to 
handle  the  comment.  Control  returns  to  the  interface 
between  the  Scanner  and  Parser,  and  the  Scanner  is  invoked 
for  another  token.  The  Parser  has  not  seen  the  comment. 


When  the  prototype  version  of  the  Scanner/Parser/Lister 
was  built,  the  comment  listing  strategy  was  considerably 
more  complex.  The  Parser  of  that  version  would  invoke  the 
Scanner  as  soon  as  it  had  handled  a  source  token,  thus 
ensuring  that  there  was  always  a  token  in  its  buffer.  Since 
we  wanted  to  list  comments  at  the  same  level  of  indentation 
as  the  next  line  of  source  code,  this  meant  that  the  List 
Comment  logic  had  to  look  ahead  through  the  Syntax  Table  to 
find  the  next  terminal,  resetting  the  left  margin  for  the 
comment  line  as  directed  by  every  Indent  and  Exdent  that  it 
encountered  on  the  way.  This  ensured  that  the  comment  would 
be  listed  with  the  same  left  margin  as  the  next  terminal. 
The  logic  for  this  was  very  complex,  essentially  duplicating 
the  Parser's  table  walking  code,  so  the  strategy  was 
changed.  Now  the  Parser  invokes  the  Scanner  only  when  it 
needs  a  token  to  match  to  the  tables.  This  means  that  the 
Indents  and  Exdents  that  we  used  to  look  ahead  for,  have 
already  been  processed  by  the  main  Parser  loop,  and  we  can 
list  the  comment  at  the  current  indentation. 
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6.6  Weaknesses  of  the  Lister 


Some  problems  arise  when  trying  to  define  paragraphing 
rules  based  on  the  Lister  that  we  have  implemented.  Some  of 
4:hese  are  due  to  the  lack  of  look-ahead  in  the  Parser.  The 
Lister  that  we  have  described  does  formatting  only  when  it 
is  invoked  by  the  Parser  in  response  to  a  paragraphing 
command  in  the  Syntax  Tables.  The  Lister  can  not  make 
decisions  based  on  the  token  that  it  is  about  to  handle. 

This  leads  to  some  problems,  in  that  the  Lister  must  be 
ready  to  handle  a  token  when  it  is  being  handled  by  the 
Parser.  It  cannot  examine  the  token  and  then  make  changes  to 
its  own  state  (for  example  change  the  left  hand  margin  to 
exdent  some  special  ^oken) .  Here  are  some  examples  of  the 
kinds  of  problem  that  this  causes. 

Consider  the  following  chart  for  a  ^L/I  IF  statement: 


— -*($) — Stt^-^evtt) - K?) — 


g_H 


'his  char4-  would  aive  the  followina  format: 


IF  expression  THEM 
statement  1 

ELSE 

statement! 

From  this  chart  we  can  deduce  the  following  points: 
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1)  the  New  Logical  Line  command  to  put  the  next 

statement  on  a  new  line  must  not  be  included  in  the 
rule  for  "statement1 2 3 4 5’,  as  we  must  be  able  to  see  the 
Exdent  after  the  statement  in  the  THEN  part  before 
seeing  the  following  New  Logical  Line.  That  is, 
the  ELSE  part  is  exdented,  and  we  must  be  able  to 
reset  the  margin  before  advancing  to  the  line  that 
it  will  be  listed  on. 

2)  an  IF  statement  with  no  ELSE  part  must  end  with  a  New 

Logical  Line.  If  the  ELSE  were  present  it  must  go 
on  a  new  line,  so  we  have  to  advance  to  a  new  line 
before  we  know  whether  ELSE  is  coming  or  not. 

So,  statements  must  not  end  with  New  Logical  Line,  but  we 
have  one  type  of  statement,  the  IF... THEN  with  no  ELSE,  that 
does  end  with  New  Logical  Line.  This  means  that  if  we  have 

IF  expressionl  THEN 

IF  expression2  THEN 
statement 1 

statement2 

the  internal  IF... THEN  will  generate  a  new  line,  and  the 
containing  IF... THEN  will  generate  an  exdent  followed  by  a 
new  line.  This  will  cause  printing  of  an  extra  blank  line: 

1  IF  expressionl  THEN 

2  IF  expression2  THEN 

3  statementl 

4 

5  statement2 

A  similar  problem  that  we  are  able  to  handle  results  from 
the  input : 
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IF  expression  THEN 
statement  1 
statement2 

Since  the  rule  for  statement  should  not  have  a  New  Logical 
Line,  by  the  argument  in  the  previous  paragraph,  then  a 
statement  list  must  consist  of  a  number  of  repetitions  of 
the  sequence  "statement.  New  Logical  Line".  Accordingly,  in 
this  example,  we  would  have  a  new  line  generated  by  the 
IF... THEN,  and  immediately  following  it,  a  new  line 
generated  from  the  statement  list  structure.  This  is 
handled  by  ignoring  invocations  of  New  Logical  Line  if  the 
last  Lister  function  was  also  New  Logical  Line. 

Another  problem  concerns  the  placement  of  END 
statements.  Suppose  it  was  desired  to  have  the  format 


DO  clause; 

{statement} 

END; 

where  the  END  is  at  the  same  level  as  the  corresponding  DO. 
Unfortunately  we  can  not  know  in  advance  that  the  END  is 
coming.  After  any  statement  in  the  body  of  the  group,  we 
may  get  either  another  statement  or  the  END.  If  we  get 

another  statement,  then  it  should  be  listed  at  the  same 

indentation  as  the  body  of  the  group.  If  we  get  an  END  we 

would  like  to  list  it  at  the  same  level  as  the  group  head. 

But  with  no  look-ahead,  we  cannot.  We  are  forced  to  accept 
the  format 

DO  clause; 

{statement} 

END  ; 
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with  the  Exdent  following  the  recognition  of  END. 


With  a  more  sophisticated  set  of  commands,  these 
problems  could  be  overcome.  A  more  complex  Lister  could 
have  been  built  to  handle  these  situations,  but  for  the 
purposes  of  our  original  project,  it  was  not  necessary. 


6.7  Example 


The  following  example  shows  how  the  Lister  prints 
programs  and  error  messages.  This  example  is  from  the 
language  of  Section  2.3.  Note  that  /*???*/  is  used  to  flag 
syntax  errors.  Consider  the  following  input: 


DO  WHILE  (I)  ; J=1 ; 

K=2  AND  GARBAGE; 

DO  WHILE  (THE_VERY_VERY_VERY_LONG_IDENTIFIER)  ; 

J=J  - 1 ; END ; 

END; 

This  is  listed  in  the  following  way: 

1  DO  WHILE  (I)  ; 

2  J=1 ; 

3  K=2  /*???*/  AND  GARBAGE; 

****  SYNTAX  ERROR  IN  PREVIOUS  LINE.  LINE  IS  REPLACED  BY: 

3  K=2 ; 

4  DO  WHILE (THE_VERY_VERY_VERY_LONG_IDENTIF 

****  IDENTIFIER  CHOPPED  TO  31  CHARACTERS 

)  ; 

5  J= J- 1 ; 

6  END ; 

7  END; 
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With  the  discussion  of  the  Lister  in  this  chapter,  we 
have  now  presented  all  of  the  Scanner/Parser/Lister.  We 
have  also  presented,  in  Chapter  3,  the  notation  used  for 
describing  languages.  The  next  chapter  tells  how  to  use  the 
Syntax  Language  and  the  Table  Generator  to  produce  a 
Scanner/Parser/Lister  for  a  programming  language. 
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CHAPTER  7 


USE  _  0  F  _  T  H  E_T  AB  L  E  _  G  E  N  E  RAT  0  R 


In  Chapter  3  we  presented  the  Syntax  Language,  which  is 
equivalent  to  the  syntax  charts  of  Chapter  2.  The  Syntax 
Language  was  developed  to  be  read  by  a  Table  Generator  which 
would  produce  tables  for  different  languages.  These  tables 
are  used  in  the  Scanner/Parser/Lister  described  in  Chapters 
4,  5,  and  6.  In  this  chapter  we  give  details  for  using  the 
Table  Generator  in  producing  a  parser  for  a  specific 
language.  Since  the  tables  are  produced  in  SUE  System 
Language  code,  some  familiarity  with  the  basics  of  SUE  would 
be  helpful  in  understanding  this  chapter. 

There  is  some  information  that  the  Table  Generator  must 
know,  which  is  not  explicitly  expressed  in  the  charts.  For 
example,  it  must  know  the  list  of  line  terminaxors  so  it  can 
pass  them  to  the  Parser  in  the  form  of  tables.  In  this 
chapter  we  will  describe  all  the  information  xhat  must  be 
input  to  the  Table  Generator,  and  give  the  format  for  it. 
As  well,  we  will  discuss  the  output  produced  by  the  Table 
Generator . 
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7.1  Language-specific  Information  in  the  Parser 

There  are  two  types  of  data  in  the 
Scanner/Parser/Lister.  The  first  type  is  data  that 
describes  the  language  being  processed.  The  prime  example 
of  this  type  of  data  is  the  Syntax  Table  which  defines  the 
syntax  of  the  programming  language.  This  type  of  data  is 
called  language-specific.  The  second  type  of  data  is  that 
concerning  a  particular  program  that  is  being  compiled. 
This  data  records  the  current  state  of  the 
Scanner/Parser/Lister  at  any  time.  Examples  are  a  counter 
for  the  number  of  lines  printed,  or  a  pointer  into  the 
Syntax  Table.  This  type  of  data  is  called  program- specif ic. 

The  Table  Generator  must  provide  precisely  the  language- 
specific  information  to  the  Parser.  This  means  that  the 
input  to  the  Table  Generator  must  be  sufficient  to  allow  it 
to  deduce  all  of  this  information.  In  the  next  section  we 
show  how  the  Table  Generator  is  used  as  part  of  a  translator 
writing  system,  and  in  the  remainder  of  the  chapter  we  give 
the  details  of  the  input  and  output. 

7.2  Structure  and  Use  of  the  Table  Generator 

The  Table  Generator  itself  is  actually  a  compiler.  It 
translates  from  the  Syntax  Language  of  Chapter  3  to  the  Sue 
System  Language.  The  target  Sue  System  Language  code  is 
used  to  initialize  tables  in  the  Scanner/Parser/Lister. 
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Because  the  lable  Generator  is  a  compiler,  it  was 
decided  to  use  the  Scanner/Parser/Lister  in  its 
implementation.  With  appropriate  tables  describing  the 
Syntax  Language,  the  Parser  is  used  as  the  recognizer  for 
the  Table  Generator.  This  means  that  the  Table  Generator 
input  is  paragraphed  and  syntax-repaired  by  the  strategies 
described  in  earlier  chapters.  A  bootstrap  version  was 
produced  by  hand-coding  the  Parser  tables  (one  line  of  code 
per  table  entry)  for  recognition  of  the  Syntax  Language. 
The  automatic  generation  of  the  tables  was  done  by  a  second 
phase,  analogous  to  the  code  generator  of  a  conventional 
compiler.  Figure  7.1  illustrates  the  construction  of  the 
bootstrap  version  of  the  Table  Generator. 


r  i  I 

|Hand-coded|  | 

|  Tables  1  | 

i _ - — - j  | 

I  I 

I  1 

V  I 

/  l  r  i  I  r  n  /  l 

/  Input  |  |Scan/Parse|  1  |  Table  |  /Output| 

| Description | --> |  /List  |-->| Generator f-->| Tables| 

L - J  L - - - J  |  L - J  I - - --J 

Analysis  Generation 

7.1  Bootstrap  Version  of  the  Table  Generator 


With  the  bootstrap  version  running,  a  description  of  the 
Syntax  Language  in  terms  of  itself  was  input  to  the  system, 
and  the  Table  Generator  produced  its  own  Syntax  Tables. 
These  tables  were  then  substituted  for  the  hand-coded 
tables. 
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Analysis 


Generation 


7.2  Generated  Version 


The  Table  Generator  and  the  Scanner/Parser /Lis ter  can 
now  be  used  in  the  implementation  of  other  compilers.  A 
programming  language  is  described  in  the  Syntax  Language, 
and  this  description  is  input  to  the  Table  Generator.  The 
output  from  the  Table  Generator  is  compiled  with  the 
language-independent  parts  of  the  Scanner/Parser/Lister  to 
produce  a  recognizer  for  the  particular  language.  The 
remainder  of  this  chapter  describes  the  specific  input  to, 
and  output  from  ,  the  Table  Generator. 


7.3  Input  to  the  Table  Generator 

As  has  been  mentioned,  the  Table  Generator  needs  more 
information  than  that  described  in  Chapter  3.  In  this 
section  we  will  give  a  brief  reason  for  the  inclusion  of 
each  different  piece  of  data,  and  how  it  is  presented. 

The  collection  of  all  the  data  fed  to  the  Table 
Generator  is  described  in  a  set  of  syntax  charts  in  Appendix 

D. 
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Semantic  Identifiers 


The  semantic  identifiers  in  the  syntax  charts  invoke 
actions  by  the  Code  Generator.  The  Table  Generator 
associates  a  unique  integer  with  each  of  these  identifiers. 
When  the  Parser  encounters  one  of  these  integers  in  its 
Syntax  Table,  it  sends  the  integer  to  the  Code  Generator. 
The  Code  Generator  can  have  a  large  case  statement  which 
selects  the  action  to  be  performed.  The  labels  on  the  cases 
are  the  semantic  identifiers  which  correspond  to  these 
integers.  For  example,  if  a  language  description  contains 
the  semantic  identifier  Perform  Addition,  then  in  building 
the  Code  Generator  for  the  compiler  for  the  language,  we 
would  want  to  use  the  label  Perform  Addition,  as  follows: 

case  Semantic  Numbers  tag  Input  From  Parser; 

•  •  • 

Perform  Addit ion : Emit (Add  Operation); 

•  •  • 

end 

This  is  accomplished  by  including  a  "declaration"  of  all 
semantic  identifiers  as  the  first  component  of  the  input  to 
the  Table  Generator.  The  keyword  SEMANTIC_#  is  written, 
followed  by  a  list  of  semantic  identifiers,  each  enclosed  in 
quotes,  and  separated  by  blanks.  Only  these  semantic 
identifiers  can  be  used  in  the  paragraphs  that  will  describe 
the  language.  Consecutively  placed  identifiers  are  assigned 
consecutive  integers.  The  Code  Generator  can  take  advantage 
of  this  ordering.  Here  is  an  example  of  specifying  semantic 
identifiers : 
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SEMANTIC,* 

*  ADD  ® 

' SUBTRACT' 

'OR' 

'AND* 


Line  Terminators 

In  Chapter  5  we  discussed  the  error  repair  strategy  of 
the  Parser,  and  pointed  out  that  some  tokens  of  the  language 
being  compiled  are  designated  as  line  terminators.  The  user 
of  the  Table  Generator  must  give  the  list  of  line 
terminators  for  his  language.  This  is  done  by  including  as 
the  second  component  of  the  input  the  keyword  TERMINATE, 
followed  by  a  list  of  keywords,  each  enclosed  in  quotes,  and 
separated  by  blanks.  Here  is  an  example  of  specifying  line 
terminators : 

TERMINATE 

'  DO' 

'END* 

Keywords  which  are  not  line  terminators  are  picked  up  from 
the  productions.  (Program  terminators  are  defined  by  the 
system  and  are  not  input  by  the  user.) 

Predefined  Identifiers 


As  well  as  having  keywords  which  are  part  of  the  syntax 
of  the  language,  many  systems  give  a  meaning  to  some 


ident if iers 

that 

can  be  used  in  programs. 

The 

most  common 

use  of  these 

is 

in  naming  builtin 

(or 

system- supplied) 

functions. 

An 

example  is  SIN 

in 

PL/I. 

A  bit  of 
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implementation  detail  will  explain  the  need  for  sending 
these  to  the  Parser.  In  our  Parser,  the  source  text  for 
identifiers  is  kept  in  the  interface  from  the  Parser  to  the 
Code  Generator,  on  the  Parser’s  side.  A  unique  integer  is 
passed  to  the  Code  Generator  for  each  different  identifier 
encountered  in  the  source  program.  The  Code  Generator  never 
sees  the  string  representation  of  an  identifier.  This  means 
that  if  there  are  to  be  identifiers  that  have  special 
meaning,  the  Parser  must  know  about  them  so  that  it  can 
initialize  its  table.  The  Code  Generator  must  know  what 
integers  are  to  be  assigned  to  these  identifiers,  so  that  it 

can  recognize  them  when  it  receives  them  on  input.  If  the 

\ 

list  of  identifiers  that  can  have  special  meanings  is  non¬ 
empty,  it  is  input  to  the  Table  Generator  by  placing  the 
keyword  PREDEFINED,  followed  by  the  identifiers,  each 
enclosed  in  quotes,  and  separated  by  blanks,  as  the  third 
component  of  the  input.  For  example,  we  could  have 


PREDEFINED 

'SIN* 

*cos » 

• SQRT' 


The  final  component 
for  the  user’s  language  as 
(See  Chapters  2  and  3.) 


of  the  input  is 
expressed  in 


the  syntax  charts 
Syntax  Language. 
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7.4  Output  to  the  Parser 


The  output  from  the 
System  Language  compilation 
t  he  Scanner/Parser/Lister . 
following  figure. 


Table  Generator  is  a  set  of  SUF 
blocks  which  are  compiled  with 
These  blocks  are  shown  in  the 


\ _ / 


Ive-koh'ifce 

Prccl  earned 

ToJkU  j 

XcU-nt-diers 

Descriptions  of  these  blocks  will  now  be  given. 

The  semantic  identifiers  in  the  programming  language 
description  must  be  associated  with  integers.  This  is  done 
in  rhe  SUF  System  Language  context  block  named  Semantic 
lumbers.  This  is  a  list  of  constant  definitions  giving  a 
numeric  value  to  each  semantic  identifier,  and  defining  the 
limit  of  these  identifiers.  This  block  is  used  in  compiling 
both  the  Parser  and  the  Code  Generator. 

The  Table  Generator  outputs  two  blocks  that  contain 
information  on  keywords.  The  Parser  has  a  table,  called 
Keyword  Table,  which  contains  the  text  of  all  keywords  in 
the  language  being  compiled.  This  table  must  be 
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initialized,  and  this  is  done  by  the  program  block  named 
Initialize  Keyword  Values.  The  second  use  of  the  keyword 
information  is  in  establishing  the  integer  values  of  the 
tokens  for  the  Parser.  This  is  done  in  the  context  block 
Keyword  Values,  which  contains  constants  used  in  the 
mismatch  logic  of  the  Parser.  These  two  blocks  are  used 
only  in  compiling  the  Scanner/Parser/Lister. 

Another  block  that  is  used  by  both  the 
Scanner/Parser/Lister  and  the  Code  Generator  is  the  context 
block  Predefined  Identifiers.  Each  such  identifier  is 
associated  with  an  integer  via  a  constant  definition  in  this 
block . 

The  final  output  from  the  Table  Generator  is  in  the  data 
and  program  blocks  for  the  procedure  Initialize  Syntax 
Tables.  This  block  is  used  in  the  Scanner/Parser/Lister  to 
set  up  its  Syntax  Table. 

7.5  Example 

Here  is  the  Table  Generator's  input  and  output  for  the 
example  presented  in  Section  2.7. 


e 
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*♦*******'('***«**#*<'<'*******++* ’f'*************************************** 

*  PROJECT  EO  TABLE  GENERATOR  VERSION  0.1  * 

*  AN  EXAMPLE  LANGUAGE  * 
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CHAPTER  8 


CONCLUSION 


This  thesis  has  discussed  a  practical  syntax-repairing 
and  paragraphing  parser  and  the  notation  used  to  describe 
languages  it  can  process.  A  table  generator  which  reads 
this  notation  and  produces  tables  to  drive  the  parser  for 
different  languages  was  also  presented.  In  this  chapter  we 
report  on  the  current  status  and  use  of  these  programs,  and 
indicate  further  work  to  be  done. 

8.1  Generated  Compilers  in  Use 

There  are  currently  three  compilers  that  are  based  on 
the  parser  we  have  described.  We  will  briefly  describe  each 
of  these. 

8.1.1  SP/k 

The  Parser  was  originally  built  to  handle  SP/k.  When  it 
was  first  used,  the  tables  were  built  by  hand,  in  the  same 
way  that  the  Table  Generator  bootstrap  version  was  produced. 
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Subsequently,  when  the  Table  Generator  had  been  built,  a 
description  of  SP/k  in  terms  of  the  Syntax  Language  was 
produced,  and  the  Generator  output  the  corresponding  tables. 
The  hand-coded  tables  were  then  discarded. 

The  SP/k  compiler  is  by  far  the  most  heavily  used  of  the 
three  compilers  produced  using  our  system.  It  has  been  used 
in  high  school  courses  in  Toronto,  as  well  as  in 
introductory  computer  science  courses  at  the  University  of 
Toronto.  It  has  been  in  use  on  the  High  Speed  Job  Stream  at 
the  University  of  Toronto  Computer  Centre  for  several 
months.  During  fall  1974,  the  monthly  traffic  through  the 
SP/k  compiler  has  exceeded  50000  jobs.  The  compiler  has 
been  extremely  reliable;  no  verified  bugs  have  been  detected 
at  UTCC  and  no  compiler  crashes  have  occurred. 

The  SP/k  compiler  is  expected  to  be  used  in  a  high 
school  on  the  PDP/11  minicomputer  early  in  1975. 


8.1.2  Table  Generator 


The  Table  Genera 
Syntax  Language 
S  cann  er/P arse r /Lister . 
Table  Generator  has  be 
Thus  far,  the  Table 
three  "programs".  Thes 
SP/k,  2) the  syntax  for 
for  the  language  calle 


tor  is  a  compiler  that  translates  the 
to  tables  to  drive  the 

As  was  explained  in  section  7.2,  the 
en  used  to  generate  its  own  tables. 

Generator  has  been  used  to  translate 
e  programs  were:  1)the  syntax  for 

the  Syntax  Language,  and  3) the  syntax 
a  Zed . 
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8.1.3  The  Language  Called  Zed 


The  Table  Generator  has  been  used  to  produce  syntax 
tables  for  a  small  implementation  language  called  Zed.  This 
language  was  designed  for  use  in  an  undergraduate  course  in 
compiler  writing.  Our  system  was  used  to  produce  the  parser 
component  of  a  complete  working  compiler  for  Zed;  the 
students  had  to  replace  the  Code  Generator  component  with 
their  own  versions. 

Subsequently,  several  undergraduate  projects  have  been 
based  on  Zed.  One  such  is  the  writing  of  a  Zed  compiler  in 
Zed,  which  is  using  our  Parser.  Again,  no  reported 
problems. 

8.2  Pedagogical  Value  of  the  Parser 

One  of  the  aims  of  our  system  was  to  be  useful  in 
teaching  programming,  and  we  must  evaluate  the  Parser  from 
this  point  of  view.  The  SP/k  compiler  is  the  only  one  of 
the  three  based  on  our  Parser  that  has  been  used  by  people 
without  computing  background.  A  brief  survey  was  made  of  a 
set  of  programs  submitted  to  the  SP/k  Parser.  These 
programs  were  written  by  a  class  learning  to  program  in 
SP/k.  The  error  repair  performed  by  the  Parser  was  compared 
to  the  human  reader*s  subjective  judgment  of  what  the  user 
intended.  When  these  were  equal,  the  repair  was  considered 
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to  be  a  correction.  From  the  set  of  83  programs  with  102 
syntax  errors,  the  following  statistics  were  gathered: 


errors  corrections  symptoms 


33 

13 

9 

8 

39 


30 

13 

8 

6 

15 


missing/unbalanced  parentheses 
missing  PROCEDURE  or  END 
missing  semi-colon 
misspelled  keyword 
miscellaneou  s 


102  72 

These  figures  indicate  that  the  error  repair  strategy  is 
very  effective.  Reaction  of  students  and  professors  to  both 
the  syntax  repair  strategy  and  paragraphing  have  been 
favourable,  for  the  most  part. 


8.3  Enhancements  to  the  System 

There  are  several  possible  extensions  to  the  work  we 
have  reported  on  in  this  thesis. 

A  number  of  improvements  could  be  made  to  the 
Scanner/Parser/Lister.  We  can  currently  handle  LL  (1) 
grammars  with  only  terminals  at  the  choice  nodes.  This 
restriction  means  that  we  sometimes  have  to  rewrite  grammars 
to  explicitly  list  all  successors  of  choices.  The  Parser 
could  be  modified  to  relax  this  restriction,  and  allow  non¬ 
terminals  to  appear  at  choices.  This  would  give  us  the 
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power  to  handle  all  LL  (1) grammars .  The  Lister  could  be  made 
more  sophisticated  in  order  to  overcome  the  problems 
detailed  in  Section  6.6,  although  this  is  not  critical;  our 
experience  with  SP/k  and  Zed  shows  that  in  its  present  form 
it  can  be  used  for  some  languages. 

The  error  repair  strategy  of  the  Parser  should  be 
investigated  further.  Currently,  we  do  not  treat  errors 
which  lead  to  a  locally  correct  but  globally  wrong  construct 
in  a  very  intelligent  way.  An  example  is 

IF  X=2  THEN; 

Y=4  ; 

ELSE 

Y=2; 

Here  we  would  lose  the  THEN  clause  because  of  the  semi-colon 
following  THEN.  Con seguent ly ,  the  ELSE  clause  would 
generate  a  syntax  error  message.  Perhaps  some  relatively 
simple  modification  of  our  basic  strategy  could  be  found  to 
handle  this  class  of  error.  It  may  be  possible  to  extend 
the  hierarchy  of  tokens  (regular  tokens,  line  terminators, 
program  terminators)  to  help  in  this  area. 

The  Table  Generator  could  do  a  more  extensive  analysis. 
Currently,  it  will  produce  tables  for  all  languages  that  the 
Parser  can  handle,  but  also  for  some  that  will  cause 
infinite  loops,  or  inefficiency  in  the  Parser  at  run  time. 
The  analysis  by  the  Table  Generator  could  check  the  input 
grammar  for  such  things  as  local  ambiguity,  left  recursion, 
etc.  [Fong]  gives  a  design  for  a  system  to  analyze  LL(1) 
grammars,  ensuring  these  properties,  and  some  of  his  work 
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could  be  implemented  in  the  Table  Generator, 
especially  true  if  the  system  is  to  be  used  by  studen 


Finally,  the  Table  Generator  and 

Scanner/Parser/Lister  should  be  used  in  producin 
compilers.  Experience  with  other  languages  could  i 
areas  of  further  study  in  the  error  repair  technig 
development  of  paragraphing  rules. 


8.4  Summary 


More  experience  with  other  programming  language 
contribute  to  a  better  understanding  of  the  shortcomi 
the  particular  repair  and  paragraphing  strategies 
used.  This  could  suggest  possible  improvements 
techniques  described  in  this  thesis.  Howeve 
techniques  we  have  presented  have  been  successful  f 
languages  we  have  implemented,  and  we  feel  that  the 
be  applied  in  the  present  form  to  other  similar  langu 


This  is 
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Appendix  A 


The  SP/k  Subsets  of  PL/I 


This  Appendix  contains  the  syntax  charts  and  the  Syntax 
Language  description  for  SP/k. 


94 


rr 


ProyaTt-T) — — >  TJ-e^+iA 


*  ! 


PROCE-UtAfe 


->  opno«js  — i  (_  — i  mAi kt 


3  -* 


•  i 

•  i 


Go.dL 
JTeljucWre* 


Proctdurt.  1 

OocAm  ) 


C Declare  (  Detlara.+;e'v\'N\ 

£cd,  ) - \  cu-.» 


-<?)—) 


-471— 


<57 


J 


/t)e  c  laro.+.'o'n  \ _ [  7~\  f  Va-'-' 

v  tu-.+  y^~tAJ  v  d 


able 
Declare. 


s4on*txiol 


S)  ?  |T]  CTT)  1 1' 


•> 

j 


-95- 


rieujcirrai^ 


riftioSea.\orf* 


£  ^  boMTidl 

Qogy>d^) — X? - *  l  ~7->(^~Go  ur\  3  yr*?  - H 


l  b*u.-ndl 


or\ 


•n  tb&uvioL  £',«Abe*4v1<i 


V  , 

r 

U 

t 

n 

•ne^covi 


(5Z> 


k — > 


p  ive_t> 


AjeA+ypft 

|l 


F-loat 


■f  l©ok.++«jpe 

’  II 


BIT 


CHARACTER. 


*  ( 


bi‘H«v^W  bi'^jpc, 


(  - >  X-A+cjer 


)  -T'h 


>? - ) 


X^+e.Qe'* 


* 


V 

13 

VARVWC- 

ck^+vpe. 


c.w«tkct\ficifiv\ 


-96- 


c 


Proc  etAnye. 

t)«c.'o  «*.+;  arv\ 


evxJticope- 


( y 

V  Tv  pe-  _y 


-X> 


FIXED 


FLOAT 


b-.+taxgH,  -Pt^oc 


v_ 


Q  IT 


*9 - > 


c 


X'n+eae^ 


* ;  — i 


) 


bi+proC 


CAAAAC-TtA 


(_  5  I'A+tAHj-  *  ) 


*  VAftVl^C- 


-97- 


$+D4ew,e.'«+ 
(^Sfg4aT»ve-»T4^  '  ><])  * 


?  Par 


as.r 


*  IF 


DO 


v_ 


CALL 


RtTUftO 


Y_ 

^  S4o4-CVAftlr\-\y 


\  I 

A  S-fe>4-e^*w'V  I 


/  Ft  \ 

A  y 


< 


■&» 

S4o^4a>nc^4 


< 


Co*.  I  i 

S4o^+evvve^+ 


I' 


Y  £e.-twn  ^  I 

~\  S4a4  &  J  I  ' 


v — > 


,  clev\"t'4-  e< 


cacl 

Hole.  wt.  ■Ac./' 


K;,ircpWt\  i, 
S4o^en+/-|' 


|  k  Q  r/^LU 


IkSots 


i  a^vv* 

V  Sf  a.~  e.r^  e^4 


} 


X? - ? 

c 

-K  st^>-’ 

)  i 

€ 

_ _ ✓ 

ExpteSS'^)  ’  ? 


s4o4c»At’i.+' 


-98- 


colledLpfoc. 


p  '■©c.Call 


y 

V  Sd-a.fewt^'t  v 


-*?■ 


J.oLm4i-fter 


Qoci. 

Xd  «.  V\ T'-Cf  tf 


fercwa+cr  ^ 

) 

— 

Il 

\  L;*+  *1 

5  ? 

• 

f€.+n 


i  ■PKeaol 


or 


or 

bar 


■^QuI*W^) 


11H 


(OumW^) - ^(^aoUorj)-^ - - <^) 

3D — fi' 


-100- 


-101- 


£-.kI;+ 


-102- 


********************************************************************** 


00  » 


2 

c 

a. 

< 

Cl 


K  2 


or 

X 


< 

LU 

oo 


< 

CO 

1  x  - 

2  O 

• 

© 

o 

t- 

0C 

co 

►-4 

m 

or 

k  y- 

es 

UJ 

Uj 

UJ 

C/> 

LU 

0 

• 

0 

UJ 

CO 

UJ 

UJ 

O  oo 

»-  t- 

o 

cr 

a 

•  o 

* 

UJ 

m 

•— 

t- 

-J 

-J 

• 

X 

LX 

CL 

t- 

2  <- 

CO  < 

<1 

G 

X 

lu  C 

a 

2 

LU 

_ i 

»— * 

m 

< 

< 

< 

2 

• 

t— 

X 

>- 

LJ 

UJ  -J 

• 

OP 

2  a 

LU  C  O 

cr  cr 

Uj 

< 

— ) 

i 

-J 

-J 

OL 

cr 

r> 

• 

< 

CO 

» 

2 

►— 

• 

0 

< 

-  1 

Cj 

c 

m 

C  <r 

X 

LU 

UJ 

x  a- 

*- 

2 

co 

2 

i  < 

LU 

a 

0 

c 

I 

• 

CO 

UJ 

2 

1 

U! 

U_ 

cr 

|O0 

0  0 

2 

2 

0 

CL 

O  -I 

X 

lo 

u 

O  1 

LU 

«*> 

o 

1 

< 

CJ 

• 

cr 

cr 

t- 

b~ 

« 

— 1 

UJ 

< 

t- 

2 

» 

0 

2 

x 

cr 

LU 

cr 

0 

CL 

CL 

<r 

cr  C 

o  c 

X 

X 

> 

< 

ID 

»- 

LU 

c 

c 

uj  a 

s 

U> 

LU 

UJ 

t— I 

*— 

LJ 

UJ 

lu 

i— . 

►— « 

LU 

< 

i  x 

1  < 

V 

t- 

►— « 

— i 

<x 

cr 

Uj 

LU 

>- 

V 

X 

Uj  2 

2  2 

o 

o 

< 

-J 

Uj  LU 

o 

a 

cr 

CL 

LJ  UJ 

< 

a 

a 

a 

cr 

or 

-J 

b- 

h- 

-J 

-J 

»- 

=3 

cr 

o  or 

I 

53 

-I  o 

2 

CL 

a. 

<1 

i- 

CL 

t- 

►- 

LJ 

t-  X 

X  X 

CD 

tr 

cr 

<X 

>  C 

2 

G 

a 

Cl 

o  t- 

Cr 

c 

o 

X 

< 

00 

CD 

u 

*— * 

i 

1  <r 

o 

OB 

UJ 

u- 

UJ 

b- 

UJ 

CL 

< 

1 

i 

1  c. 

O 

> 

i 

1 

i  <_>  a 

c  o 

1 

1 

cr 

o 

•—  |  LU 

Q 

i 

1 

or  «_? 

<1 

UJ 

o  c 

> 

LU 

<3 

<X 

—i 

c 

2 

LX 

b- 

It- 

1  c 

►-« 

cr 

> 

>- 

»- 

i  <i 

b~ 

o 

<L  CC 

cc  c c 

cr 

cr 

0 

<5 

CO 

t-  h- 

=J 

UI 

c 

b~ 

a  < 

CL 

CO 

CO 

UJ 

1  QC 

*— • 

a 

i  c 

IX 

UJ 

i  < 

<>. 

CO 

< 

CO 

»— • 

t- 

t- 

0 

a 

Cl 

0 

• 

X 

b- 

cr 

1  <a 

UJ 

OJ 

cr  1 

1  1 

LU 

Uj 

CL 

i 

1  <  CL 

1  O 

LU 

<a 

1  CL 

1 

1 

1  o 

C 

a 

cr 

< 

h- 

c 

X 

»- 

X  LU 

CO 

LU 

on 

> 

-J 

CD 

c. 

<. 

< 

o 

0 

o 

U- 

< 

H- 

O  x 

LU 

<  a 

O  LU 

a 

< 

3 

J  o  < 

t- 

D 

X 

c 

t-  < 

3 

o 

3 

c 

2 

X 

< 

i 

*— • 

-J 

c 

c 

O 

(X 

UJ 

cr 

LU 

*— » 

G 

3 

Cl 

2 

2 

2 

cr 

o 

*— • 

Uj 

i 

*-4 

-J 

i 

X  2 

2  2 

a 

o 

►— 

UJ 

LU 

LU  K 

*— * 

cr 

•— 

-J 

—  i 

LU 

2 

LU 

cr 

LU 

LU 

> 

u> 

CD 

LL 

LU 

o 

UJ 

O 

-J 

O 

u 

1 

<1 

c 

| 

CO 

<1 

—/ 

X 

<1 

1 

< 

O  2: 

cr 

-J 

o 

CC 

Lu 

1 

< 

LL 

UJ 

o  u 

1  1 
<1  <1 

Lu  G 

W| 

<1 

-J 

CO 

2  CO 

1  1 
<1  <1 

CD' 

| 

a 

Lu 

| 

LX 

1 

cr.  (_i 

1  1 
<i  « 

2 

1 

LU 

1 

2 

11 

CL 

1 

<1 

<1 

I 

<X 

< 

<1 

< 

<L 

<1 

< 

<1 

< 

< 

<1 

< 

<a 

<. 

< 

<x 

< 

<1 

< 

1 

<1 

1 

< 

I  » 
<  <. 

1 

< 

<1 

< 

1 

<L 

< 

< 

< 

< 

•a 

<1 

< 

Hf\ifo^ir>or'«0'0'<(M(o^in<Ch-»0'Or4(Nj(o<}-ir<of'ffi(j'Ot^cg(Oj-if\«oh-®o'OH(\j(o<tin^Neo(>Or-iM 


-103- 


ZOK 
tu  •—  a. 


2 

X 

in 

O 

UJ 

in 

CL 

«— > 

m 

(- 

LU 

o 

in 

2 

< 

a 

m 

in 

O 

t— 

CL 

CD 

e» 

• 

• 

LL 

n* 

O') 

X 

*"**■ 

LL 

m 

LL 

2 

04 

» 

in 

1 

LL 

in 

-J 

LL 

m 

to 

LU 

Q- 

2 

• 

m 

►- 

1 

1 

• 

CC 

-J 

IL 

«J 

X 

X 

O- 

h- 

m 

m 

u. 

6» 

2 

►— 

O 

» 

d 

n- 

— J 

UJ 

»— 

m 

» 

UJ 

X 

2 

X 

CL 

01 

Uj 

a 

UJ 

CL 

2 

• 

O 

c 

X 

1 

l 

UJ 

o 

m 

1 

»— 

UJ 

m 

n- 

Cl 

O 

X 

►— 

d 

c 

• 

UJ 

< 

CL 

X 

X 

LL 

LL 

in 

2 

2 

LL 

2 

h- 

X 

X 

< 

2 

Ql 

X 

c 

> 

X 

LL 

< 

1 

n« 

►— < 

-j 

UI 

CC 

CL 

CL 

UJ 

»— * 

in 

LU 

o. 

in 

o 

CJ 

1 

1 

CL 

LJ 

X 

> 

K 

1 

1 

1 

UJ 

X 

it- 

1 

1 

1 

1 

1*- 

in 

h» 

c_ 

1 

1 

» 

1 

1  Lf- 

LU 

o 

C J 

1 

1 V-  t- 

LL 

d 

K** 

»— 

K 

H* 

h- 

in 

CC 

Ll 

2 

c 

> 

o 

c 

o 

c 

c 

CL 

2 

2 

U. 

LL 

LL 

LL 

CC 

LU 

Uj 

X 

— "s 

X 

X 

LL 

LU 

1 

2 

1 

CL 

I 

l. 

| 

c. 

1 

o 

i 

CJ 

1 

Cl 

I 

C. 

I 

Uj 

1 

Lu 

a 

1 

1 

CL 

| 

CL 

a 

CL 

I 

in 

i 

O 

a 

C 

i 

CL 

a 

c. 

1 

CL 

i 

d 

m 

• 

— J 

i 

1 

d 

1 

d 

1 

<1 

d 

1 

d 

1 

<1 

1 

< 

I 

d 

1 

d 

1 

< 

1 

<1 

1 

d 

1 

< 

1 

d 

1 

d 

1 

< 

i 

< 

l 

< 

1 

< 

1 

d 

1 

<1 

i 

< 

d 

1 

< 

1 

< 

-  £X 
>  < 
<  _j 

CL  C 
CL  O 
<t  «✓> 

I  I 
o  O 
z  z 

«3  C 
I  X 
I  I 
t-  h- 

U-  LL 


— I  _J 

I  i 
<1  < 


fO  IT 

in  ip  m 


'ONCDC'Or-<(\jpn 

ir  ir  in  in  >0  >c  vC  c 


^  in  >o  n  ®  O'  o 


h  rj  ^  >t  ip  o  N 
h*  h  h  ^  n  h-  n 


00  O' 


m 


LL 

N-« 

Ql 

m 

m 

• 

X 

•mi 

i- 

1 

a 

Q 

-J 

vn 

Uj 

UJ 

UJ 

< 

NM 

CL 

>- 

O 

U 

-J 

X 

LL' 

c 

| 

t 

o 

X 

or 

IL 

m 

UJ 

UJ 

< 

a. 

ot : 

z 

cz 

o 

CL 

1 

X 

N* 

o 

d 

o 

c 

<3 

O 

CL 

N 

Q. 

UJ 

LL 

X 

UJ 

a 

m 

LL) 

N 

•> 

• 

i  _i 

L— 

1  CJ 

1 

z 

(X 

2  Q 

cx 

X 

c 

J 

c 

C- 

Cj 

o 

M 

LU 

m 

< 

» 

• 

cx  u. 

h— 

• 

• 

• 

2 

d 

CL 

2 

a: 

2 

d 

h- 

- J 

-J 

LU 

m 

• 

2 

X  2 

in 

o 

2 

H* 

m 

» 

o 

LL 

LJ 

CL 

LL 

CL 

LL 

X 

d 

L-> 

m 

c 

• 

1- 

LL 

CL 

C  2 

in 

d 

CC 

c 

a 

2 

1 

i 

1 

1 

i 

1 

1 

2 

d 

UJ 

c 

-J 

2 

Uj 

LL 

X 

X 

LU  LL 

LU 

O 

*—< 

c 

»- 

o  o 

X 

<■ 

<1 

< 

d 

d 

<. 

d 

<c 

N* 

In 

o 

Ci 

LL 

LL 

o 

N-» 

CL 

t- 

a.  Uj 

in 

=J 

X 

in 

o 

d 

in 

L 

Ll 

CC 

in 

X 

Uj 

CL 

UJ 

-J 

UJ 

CL 

X 

t- 

Cl 

GC 

o 

CM 

CO 

in 

o 

r*- 

00 

c- 

o 

•—4 

CM 

po 

in 

«c 

P- 

00  O'  o 

CM 

PO 

in 

«o 

h- 

00 

O' 

o 

CM 

CD 

00 

00 

00 

oc 

00 

00 

co 

CC 

oc 

O' 

O' 

O' 

O' 

O' 

O' 

O' 

O' 

O'  O'  o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

H 

H 

r— 4 

N-J 

r— < 

^“4 

rH 

H 

-104- 


G 

oc 

< 


ex 

X 


z 

G 

< 


>■ 

c 

o 

oc 


(X 

=5 

a 

UJ 

u 

o 

oc 

a 


< 

g 


3 

G 

z 


z 

Ll 

Cj 


<3 

m 

< 


< 

2: 


»/i 

z 

o 


O. 

c 


UJ 

OL 

X 

c 

UJ 


o 

z 


I 

w 

z 

< 

of 

CD 

UJ 


t-  Z 

X 

UJ  » 
Z  K 

z 

CM  Uj 

x 

X  U- 

o  *- 
z  < 

<  J- 
00  U) 
CO  i 

t- 

-  z 

Z  UJ 

O  X 

—  z 
*-  e> 
«  — 
OC  CO 

<  co 

_i  < 
UJ  - 


D 

Z 


X 

<_> 

z 

<. 

or 

CO 


X  G 
G  Uj 
Z  00 
«  < 
oc  _j 
cc 


V  CO 

o  — 

O  _J 
CO  I 
I  UJ 
UJ  oc 
or  X 
<1  O 

—1  UJ 

o  g 

LLi  C 

D  a 

-  a. 


UJ 

as 

< 


oc 

UJ 


CO 


g  <r 
_j  - 
«. 

o  < 


o 

X 
m  cj 


LU 

ID 

X 

> 

-  UJ 

UJ  LLi 

►— 

a 

UJ 

2 

0 

LL  </) 

CO  C 

IL 

or 

2 

UJ 

0 

or  *— 

0  — 

»— 4 

a 

O 

cc 

<  :r 

C 

fr- 

• 

UJ 

1 

-j  a 

X 

2 

•—4  •• 

l-J 

Uj 

(D  UJ 

CM  U 

l-J 

UJ  «3 

a  g 

X 

G  - 

Uj  K 

LU 

(_>  1/) 

00 

O  ►** 

< 

OC  —1 

Cl  1 

-  UJ 

• 

a 

»- 

co 

_J  c 

•H 

<  UJ 

-J 

G  G 

1 

C 

ui 

•  oc 

OC 

uj  a 

X  1 

O 

-3  O 

G 

z  z 

t-  G 

1  UJ 

X  O 

UJ  1 

G  QC 

oc  < 

z  a 

1 

c 

m  0 

G  < 

z 

G 

X  G 

O  Ui 

O  | 

OC  CO 

Z  4 

a  •— 

<3  • 

1^ 

a 

<  a 

as  < 

•  UJ 

X 

y~  g 

<3  *- 

Z  co 

•  •  O  1- 

LL!  m 

X 

D  3 

u 

•—  ex 

z 

1  in 

O  X 

<  (- 

CD  C 

a 

z 


z 

oc 

X 

»- 

Uj 

OC 


3 

Uj 

Z 


co 


Z 

UJ 

X 


< 

G 


c 

z 


co 

o 

c 

X 

G  G 
K  UJ 

•  x  in 

K  G  <3 
g  Z  -j 

3  2  uj 

la  z 

Z  D  ■- 
G  t-  G 

*— 1  Qj  | 

co  t~  oc  3 
— ■  <r  uj 
a  uj  Z 
•  <  z 

>-  _J  G  t— 

c  g  g  z 

a 

CO 


I  UJ 

3  C 


UJ  X 

CO  Uj 

O  Z 

o 

I  z 

g  ex  o 

x  z 

-KG 
l-  UJ 
>-  (C  H 

z 

3  UJ  X 

IZ  G 

z  —  z 

O  G  «j 

*-  I  a 

J-  3  cc 

<  UJ 

a  z  U' 

<  z 

j  1- 
C  Z  J 
Uj  UJ  I 
0  0  3 
•  XU' 
U  Z 

G  •*  - 

G  1 

c 

o 


CLJ 

< 


CO  <3 

g  oc 
< 
-  _i 

K  o 

~  UJ 
Z  Ci 
X  I 

I  K 

z  a 
u  o  < 


CO 

o 

O 

X 

g 

J — 

•  x 

UJ  Uj 

a  z 

<t 

-j  z 
o  a 

ii  1  o 

G 


co 


x 

UJ 

z 

z 

a 

x 


co 


co 

Q 

Z 

X 

O 

CO 

I 

a 

z 

UJ 


<  G 

z 


co  Z 

•—  a 

_J  3 

I  K 
co  uj 
O  a 
z 

x  - 

O  ex 
oc  < 
•  -j 
c 
-J  o 
_j  co 

<  I 
G  3 

UJ 

.  z 
>  I 

<  4 

a  • 

a 

<  < 

l 


UJ 

CO 

O 


1  g 

«■ 

UJ 

3  G 

G 

0 

0  a 

G 

CD 

G  CO 

z 

I 

k 

O 

a 

< 

Z  — 

< 

0 

1  - 

z 

< 

-J 

13 

or 

G  G 

G 

G 

<  a 

co 

• 

_i  a 

G 

a 

•  G 

1/0 

a: 

os  > 

^4 

G 

UJ 

G 

X 

K 

►— » 

< 

c  1— 

O 

*—4 

CO 

<  t- 

Z 

fr¬ 

•  0 

X 

1 

LL 

O 

•OK 

G 

<& 

iz) 

a 

G 

c 

►-4 

O 

X 

O 

K 

1 

«  _> 

Z 

U)  t- 

H- 

X 

G 

*—•  Uj 

I/O 

-J 

< 

>  G 

< 

G  | 

2  — « 

G 

Z 

l-J 

*— • 

UJ 

'LL  G 

c  r>  a 

LU  G  X 

3)  1  • 

LU 

2 

X  t- 

00  -  < 

CX  X  G  G  G 

c 

X  G  G 

oc 

:co  a 

<  oc  Z 

-J 

CO  O  H- 

CL  Uj 

2 

ta-4 

ar  <3 

1  0 

eo  a:  g  co  0 

<1 

a  l  lt 

<  <3 

■D  G 

ac  G 

LLI 

0-0 

LL  -J 

► 

oc 

<1  <=J  *“■*• 

•  CL  C  G 

CL 

CO  0 

G  G  * 

(C 

C  G 

CO 

0 

O  <  ^ 1 

O  <3 

•  -J 

0  <  0 

00c 

'X 

Z  G 

< 

X 

2  -J  < 

Z  G 

Z  G  X 

Z  Z  X 

10 

G  C 

-J 

•— 1  G 

U/UU 

G  G 

<  «-<  CJ 

G  CX  G 

G  Z)  G 

a 

a 


< 

> 


o 

ec 


'•t  IT.  <c  s  .  ooo'0«-‘*Mfo^-uo<cr^coa'Or-ifMP04-if'%or^coo'0«-ucMfO«3-tri%or~ooovOi-(rMfo«i'iri>or^ooC'0^cMro^-ir\or^ooO'0 

•— «  •-«  — •  . — ■  iir-irjM(MMM(MMMNNrofororororo(OfO(0(OvJ4'^^>t<}-^>}^4in»f'ir.  (ririr>irioicir'C>D>CvO'C'C'C'0«c<CN 

^  9^  r— I  •— 4  f— 4  r— ♦  •— <  f— 4  H  *— 4  r-i  iH  »— 4  f-H  f-4  H  r“4  rH  4  0—4  vH  H  H  ^4  *—4  ^4  *=~)  •— 4  ^4  *-4  ^-4  r-4  ^*4  r4  f— 4  c=4  H  H  ^4  H  H  »*4  r—4  1 


-105- 


UJ 


Of 

Cj 

z 

UJ 

z 

a : 

x 


Lb 

a 

o 

u 

oo 


(C 

< 


< 

< 


x 

c 

£E 


X  o 
Lb  Z 
Z  UJ 


> 

o 

o 

cc 


Of 

cc 


u 

o'. 


a 

a 

3 


Cl 

z 

c 

CD 


< 

Lb 


D 

Z 


oo 

C 

C 

X 

l_J  t— 
X 


x 

z 


ro 

X 

o 

z 

< 

Q 

cc 


c 

z 

c 

CD 


H-  t—  UJ 

XX  CC 

u  id  *3 


co 

< 


z  z 

a  ol 
X  X 


or  a: 


X 

I- 

d 

z 

lb 


-  X  I-  — 
X  3 

Lb  Of 
Z  UJ 


d 

UJ 


<1 

(Si  (_' 


_l  O0 

X  •—  uj 
CD  oO 

-  X 

X  C  X 
Z  Z  X 

w  ro 
o 

CD 


u  u 
O  oo 
LU  *— 
t-  3 
Z  Of  lb 

•—  U - i 

X  X  00  Lb 
I—  Of  ■—  OO 

Cj  X 

O  •  X 
Z  Lb  X 
Lb  a  o 
> 


<  «&  X  Z 
X  — 
•  •  t_>  — 

o  •- 

Lb  <  » 

X  X  (— 

►-  _J  •— 

Lb  u  m 


CD 

«1 


X 

►- 

o 


of 

X  o 

t-  z 


lb 

< 


c 

Cl  z 

a 

z 

»— 

UJ 

UJ 

►— 

m 

cr. 

z 

U- 

z 

z 

X 

z 

h- 

c 

a 

CL 

LL 

X 

< 

cr 

OO 

— ' 

X  (V 

LL 

1 

< 

X 

z 

> 

> 

CC 

CL 

U) 

X 

h- 

*"■ 

c 

c 

o 

ir, 

< 

i- 

Uj 

lb 

k- 

K 

1 

> 

z 

u 

X 

k- 

c 

CD 

CD  I 

* 

00 

Z 

1 

1 

< 

k- 

< 

1 

UJ 

ll 

X 

1 

1  CJ 

3 

Z 

c 

I- 

1 

QC 

LL 

<r 

2 

cr 

o 

or 

o  z 

Of 

< 

X 

Z 

Ub 

►— 

c 

on 

m 

lb 

z  < 

lb’ 

X 

a. 

X 

X 

C 

< 

►— 

O 

UJ 

- 

3 

U  cc 

X 

Uj 

1 

o 

_ 

X 

LL 

O 

< 

on 

u 

o 

X 

1  CD 

K 

UO 

lb 

K 

z 

LL 

IL 

CL 

1 

on 

X 

X 

•— 1 

*r 

z 

<  - 

X 

K 

»— 

> 

Uj 

<1 

1 

1 

LU 

< 

UJ 

CL 

UJ 

X 

c 

X 

1 

m 

X 

5 

a 

a: 

< 

< 

UJ 

C 

X 

w 

Uj 

z 

cr 

— 1 

X 

< 

LL 

a 

K 

a: 

m 

on 

UJ 

CL 

m 

O 

< 

LL 

CD 

m 

m  -j 

z 

LL 

< 

C 

•— »  < 

u 

a 

U. 

H-  Z 
U>  •— 

c 

Of 

<1 

X 


Z  (SI 
Of 

X  X 
K  U 
Lb  Z 
Of  < 
CL 

•  a. 

C 

z  - 

•—  Of 
>-  UJ 
Of  I— 
C  Lb 

>  X 
-  < 

Of 

>  <3 
Lb  C. 
CL  I 

>  Of 
►—  UJ 

I  H- 
a  cj 

Lb  < 
»-  Of 
U  C 
<3  X 
Of  u 
<r  l 
X  sf 
U)  o 
I  Lb 

<  X 

•  lb 

I 

*3  < 


< 

* 


c 

u 

u> 

c 

Of 

O. 


< 

u 


3 

UJ 


.  D 

Z  <1 
X  UJ 


ao 

< 


o. 

> 


Of 

< 


I  OO 


X 

of 

a 

z 


o 

UJ 

X 

I 

U- 

Of 


o  o 

LU  < 
O  Lb 

C  X 

Of  I 
Q.  Lb 

-  a 

— ' 

_j  x 

_J  UJ 

<t  o 
u  o 

Of 

a 


OO  UJ 

C  cc 

c  < 

X  -J 
o  — 

• 

•  00 
LU  CO 

a.  uj 
X  n- 
CJ  Lb 
o'  Z 
I  <3 
3  CO 

lb  <L 

z  o. 

I  I 

<  _J 

-  < 

s. 

<.  CL 

x 


z 

a 

X 

t- 

UJ 

Of 


Of 

X 

t- 

UJ 

Of 


u 


OO 

z 

Of 


o  x 

<  z 

UJ  Lb 

I 

|(M 
lb 

of  X 

X  O 

x  z 

lb  < 
CJ  Of 
C  CD 
CC 

C_  Lb 
|oO 

<  ■— 
*  3 

CL 

(Si  «3  lb 


Cl  _J 
lb  < 

CJ  CJ 

a  — 

CO 

a 


00  Qf 

o  - 

o 

X 

CJ 


X 

CJ 

z 

< 

a 

cc 


oO 


O  t—  — 


CO 


I/O 

Of 

u- 

»- 

Ui 


CD 

< 


X  3  < 

Lb  Of  Lb  Cc 
Z  lb  _J  < 

x  x  a 

>-  CL  I 
Cj  _j 
X  < 
Z  X 

lb  Of 

o 


_J  OO 

Lb  X 

<e  a 

<.  X 

—  CJ 


Z  IS 

UJ 

o 


(S:  po  •J'  ir  vO 
r~-  r~- 


a: 


J'0-iM»'i-fK'CNa30'ObN(04t(l<C 
r~  (>■-  ao  cc  oc.  cc  cceocmoca'C'O'O'DC'O'D 


N  CO  O'  O  h 
O'  O'  O'  o  o 


fNj 

m 

IP 

>o 

r- 

CL 

O' 

o 

CM 

m 

IT 

o 

r>- 

CD 

a 

O 

rg 

CP 

IP 

vC 

f- 

CC 

a 

c 

O 

o 

O 

o 

c 

o 

O 

o 

pH 

•- 1 

CM 

CM 

CM 

CM 

fM 

CM 

(S 

CM 

PM 

rg 

po 

OsJ 

rg 

<NI 

(Si 

fg 

fM 

(SI 

CNI 

CM 

CM 

CM 

CM 

CM 

CM 

(SI 

fM 

CM 

CM 

CM 

CM 

CM 

CM 

CM 

(SI 

CM 

rg 

rg 

(s 

-106- 


-j 

UJ 

S3 

•Si 

-J 

• 

t- 

O 

o. 

z 

LU 

CL 

O 

r* 

C/0 

as 

X 

3 

u 

CJO 

z 

1 

h* 

< 

O 

X 

CL 

z 

UJ 

Oj 

< 

z 

X 

m 

1 

z 

o 

»- 

CL 

2 

u. 

3 

UJ 

K- 

_ l 

UJ 

cr 

1 

CL 

< 

< 

> 

Ci 

m 

• 

LU 

«* 

z 

¥- 

on 

LU 

«a 

Z 

o 

• 

UJ  3 

o 

LU 

r"J 

K 

X  z 

X 

»— 

CL 

X 

*- 

LU  U 

LJ 

X 

t- 

3 

X 

Ui 

»- 

OO 

1- 

LU 

X 

o 

u 

z 

X 

CW8 

<  — 

• 

2 

LU 

LU 

z 

k- 

UJ 

-J 

K 

2 

z 

O 

< 

X 

z 

z 

8 

U0  X 

o 

o 

cr 

V- 

h 

LU 

CL 

lUJ 

►— 

— J 

r—t 

a: 

CD 

X 

X 

z 

3 

z 

a 

»-  Z 

IT) 

X 

X 

LU 

CL 

Ui 

UJ  K 

H 

CL 

Z  <3. 

on 

LU 

-J 

u 

CD 

X 

1 

LU 

z 

2  X 

X 

z 

LU 

3 

a 

LU  CL 

UJ 

Z  1- 

LU 

2: 

<1 

a 

z 

UJ 

LU 

CL 

CL 

*- 

LJ 

X  CD 

a 

X 

CD 

< 

-J 

z 

UJ 

•— 1 

z 

z  z  z 

3 

UJ 

OO 

z 

a. 

Z  UJ 

<1 

a 

«—• 

< 

►*» 

CL 

Cd 

k- 

m 

CL 

CD 

o  - 

X 

CL  Z 

CD 

• 

a. 

o 

1 

3 

3  2 

Z  U. 

K 

3 

—  CL 

UJ 

3 

X 

az 

< 

3 

t- 

l—  tt 

O' 

CL 

Z 

• 

to 

C/0  < 

8 

1-  — 

• 

» 

h- 

CL 

LU 

LU 

ID  3 

LU 

»- 

• 

CD  _J 

K 

Lu 

tu 

UJ 

o 

-  c 

<1 

Z 

CL 

Cl.  h- 

h~ 

m 

X 

Z 

<L  <.  • 

CL 

CL  I 

a: 

0L 

2 

UJ  Z 

X 

UJ 

Ui 

k- 

UJ 

UJ 

-J 

lo 

-J 

•*  (_J 

3 

3 

LU 

(X  LU 

o 

• 

m 

•  QC 

CL 

Z 

t— 

X 

-J 

<  C/0 

££ 

Uj 

—  Z 

c, 

Q 

-J 

3 

J 

H- 

V- 

H 

LU 

<1 

UJ 

< 

-  1 

Lj 

CG 

-  < 

LU 

IU 

1 

O  CsJ 

< 

h 

2 

2 

2  • 

m 

X 

K 

K 

o 

a 

on 

< 

h-  a. 

u 

LU 

k— 

LU 

• 

X 

LL 

UJ 

UJ 

►“ 

UJ 

«/l 

< 

<  z 

CD 

-J 

CD 

o 

o 

»— * 

U  X 

LU 

s: 

X 

X  2 

2 

H 

1 

*-  c 

•> 

••  ^ 

3 

a: 

Cl 

CD 

C  (_J 

< 

2 

LU 

LU 

LU  UJ 

LU 

< 

Z 

uo  z 

> 

-  X 

00 

• 

— 1  •» 

a. 

CL 

1 

a.  Z 

h- 

K 

h-  X 

X 

k- 

CL 

1  LU 

< 

Z  1 

1 

o 

Q_ 

1  UJ 

1 

1 

< 

a  < 

a 

2 

< 

UJ 

< 

C  UJ 

LU 

I/O 

»- 

00 

CL 

O  1- 

< 

z 

*—■ 

t-  o 

Q 

»- 

• 

1  a 

UJ 

a 

h 

to 

»- 

H-  K 

L- 

II- 

Z  r-c 

• 

CL 

•—  u. 

UJ 

3  < 

LU 

< 

t-  cr 

e 

— N 

1/) 

a 

on 

on  < 

< 

-J 

UJ 

LL 

<1 

c/o  ui 

on 

a.  a 

X 

o 

<1 

t— 4 

UJ 

►— 

o 

1 

1  ►— 

►— 

•J 

oc 

-J 

X  X 

1 

U0  _J 

<; 

z 

1 

-  1 

K 

_J 

CD  LU 

►— 

LU 

X 

K 

K  00 

00 

< 

LU 

Z  C 

1- 

o 

UJ  1 

CL 

H 

t- 

u. 

X 

U 

CL 

ll/O 

2 

a 

-J 

u 

LU 

3  1 

| 

LJ 

CD 

o  z 

2 

z 

CL  < 

m 

*- 

J  3 

1 

LU 

I 

UJ 

<1  •— 

M- • 

on 

-J 

O 

Cl  LL 

o 

» 

-J 

< 

•—  <s 

U- 

c 

C-  - 

on 

2 

X 

f- 

a 

-J  CL 

< 

Z 

<£ 

UJ 

O 

•  X 

w 

h- 

<1 

m 

* 

m  •— « 

o 

•J 

-J 

LO  CL 

X 

X 

X 

t-4 

C 

Ui 

yj 

OO 

8 

<  1 

m 

• 

on 

UJ 

CL 

•> 

2 

O 

►— 

X 

•» 

• 

-J 

< 

c/0  CD 

UJ 

1 

LU  <L 

►— 

z 

a: 

<3 

o  <1 

z 

C 

h-  C\J 

«  IT 

QL 

m 

UJ 

2 

U1 

-J 

<=J 

-J 

LJ 

a 

< 

♦— 

t— 

e» 

» 

on 

» 

—  - 

<x 

a: 

< 

a 

2 

X 

U_ 

h- 

O 

LU 

LU 

2 

— J 

-J  -J 

u 

<1 

LU 

*  J- 

U- 

Oj 

h- 

on 

LU 

• 

m 

3 

X 

►— * 

K 

H- 

00 

X 

on 

on 

2 

< 

<  -J 

-U 

z 

LU 

— j  to 

on 

LU 

on 

H- 

<1 

t-  < 

m 

h- 

«* 

LJ 

w 

o  t- 

LJ 

•— * 

UJ 

•— < 

♦— 

LU 

2 

u 

u  c 

< 

z 

LL 

_J  LU 

on 

-J 

«=J  •— 

*— • 

esc: 

I 

e=-» 

2 

UO 

c 

UJ 

X 

<. 

-J 

jt 

H- 

CL 

o 

o 

» 

CL 

_J  o 

| 

1 

<1  X 

a 

o 

LU 

»—  «> 

LU 

a. 

u. 

Ol  UJ 

1 

• 

a  lu 

• 

< 

• 

<J 

3 

«I  •— 

UJ  K 

<1 

U  CL  UJ 

1 

X 

z 

C£  LU 

X 

CL 

-J  UJ 

CD 

z 

CJ 

m 

• 

m 

X 

LU 

Ci 

o 

X 

<i 

►— i 

G_ 

Z 

o 

— 

a 

IX 

lb 

<— j 

o 

Cl 

k-  o 

• 

er> 

o 

a: 

X 

C  UJ 

cc 

o 

Cj 

UJ 

• 

c.  a  c 

o  o 

cr 

a 

OO  O 

2 

3 

CJ 

Z  1- 

<1 

X 

z 

»- 

z  — 

I 

z 

l/) 

<1 

<. 

z 

IX 

LU 

h- 

UJ  c 

-J 

H 

o 

LL 

< 

< 

LL  in 

O 

UJ 

cc 

=J 

7—4 

o 

UJ 

1-  u> 

LU 

y~ 

k- 

OO 

3 

3 

CL 

00 

00 

<1 

OO 

a 

m 

• 

«> 

• 

®> 

m 

> 

in 

< 

GO 

O' 

o 

C\J 

ro 

•4- 

in 

<£> 

an 

O' 

o  — 

CNJ 

in 

O  f- 

00 

O' 

o 

^■4 

CNJ 

m 

in 

•£> 

r- 

CD 

o- 

o 

•-»  ro 

m 

<4- 

in 

<o 

00 

O' 

o 

rJ 

ro 

rr 

in 

<3  t~ 

CO 

O'  o 

m 

cr> 

ro 

ro 

m 

•4- 

•4- 

•4- 

>4- 

>4- 

•4- 

in  in 

in 

IT 

in 

ITi 

in 

ir. 

in 

in 

v0 

>o 

sO 

s0 

•o 

•D 

-C 

vD 

v0 

•o 

r- 

r-  r- 

r- 

r- 

r- 

r- 

r- 

k- 

00 

cc 

CD 

CD 

oc 

CC 

00  00 

CD 

CD  O' 

O' 

<\» 

fSi 

(\J 

(\j 

CNJ 

<\J 

<\i 

<\i 

C\i 

(\J 

ro 

CM 

ro 

ro 

ro 

ro  ro 

ro 

ro 

CM 

CM 

ro 

CM 

ro 

ro 

ro 

ro 

ro 

ro 

ro 

rsi 

ro 

ro 

ro 

ro 

CM 

CM  CM 

CM 

ro 

ro 

ro 

C\j 

CM 

CM 

ro 

ro 

CNJ 

CM 

ro 

ro 

<M  CM 

ro 

ro  rj 

ro 

107 


I 


UJ 

03 

< 


a 

<_> 

to 

<D 

X 

to 


<i 

X 


03 

< 


to 

O 

O 

X 

o 


to 

to 


X 

UJ 

2 

2 

ct 

X 


to 


ct 

u 

to 

cc 

X 

to 


<J 

u 


UJ 

to 


o 

2 

UJ 


f«0 

1 
o 

2 
< 
ct 
CD 


Ct 

« 

_J 

<3 

<_) 

to 

I 

o 

2 


UJ 
UJ  CD 

to  < 
O  -J 
D  — 


X 

UJ 

2 

2 

Ct 

X 

►— 

UJ 

ac 


UJ 

to 

C 

O 

X 


a 

UJ 


to 

to 

UJ 

ct 

O. 

X 


2 

Ct 

X 

►— 

UJ 

ft 


2 

C 

to 

to 

UJ 


C 

2 

UJ 


a: 

cl 

m 

o 

2 

<1 

X  - 

_i 

o 

< 

QL 

LJ 

CL 

-j 

• 

> 

o 

QC 

X 

o  t- 

2 

a 

CL 

2 

X 

UJ 

< 

I 

X 

1 

to 

< 

UJ 

< 

X 

< 

LU 

CD 

<=* 

a: 

o 

X 

K 

b- 

•  ' 

«j 

a 

UJ 

ct 

1 

< 

K 

CL 

LU 

UJ 

LL 

UJ  _l 

i 

rg 

CD 

K 

-J 

to 

< 

«- 

2 

or 

UJ 

a  | 

Uj 

o 

a 

x 

o 

1 

X 

— i 

o 

X  ct 

Ct 

X 

o 

-J 

- 

2 

CL 

• 

-U 

2 

c 

UJ 

r-4 

LL 

1 

2 

o  uj 

X 

o 

2 

UJ 

-J 

2 

LL 

1 

C 

Cl 

|U 

2 

h- 

l/) 

<1 

UJ 

UJ  t- 

o 

2 

UJ 

1 

c 

< 

ct 

< 

2 

k- 

< 

mm 

X 

•— « 

• 

CJ  UJ 

UJ 

< 

< 

2 

o 

X 

r- • 

•» 

UJ 

it 

UJ 

rH 

X 

lu 

(Nl 

CO  X 

u 

cc 

r-H 

m 

LL 

►- 

to 

c 

| 

K 

2 

a. 

<1 

ct  < 

c 

CD 

» 

UJ 

X 

< 

2 

1 

•» 

X 

-J 

b- 

LU 

< 

UJ 

X 

-J 

C.  Ct 

ct 

X 

< 

2 

2 

ct 

LJ 

Ct 

K 

o 

UJ 

LL 

O  ct 

X 

LU 

o 

LL 

1  < 

a 

UJ 

o 

CL 

i r. 

CL 

K 

2 

• 

H 

X) 

U 

-J 

2 

CL 

UJ 

let. 

»— 

to 

2 

CD 

C  CL 

1 

to 

2 

• 

h- 

*~^ 

X 

< 

<< 

2 

X 

I'¬ 

e 

_J 

< 

«Q 

mJ 

<x 

► 

o  I- 

*— • 

< 

to 

< 

UJ  - 

< 

►-* 

«a 

oo 

2 

X 

K 

b- 

LU 

m 

CL 

c 

Uj 

Ll 

m 

1 

«3 

(X 

mJ 

I 

» 

X 

ct 

-J 

-i 

m 

3 

CL 

*rnm 

Q  UJ 

LL 

• 

LU 

2 

CL 

*— * 

CL 

*— i 

< 

cj 

CL 

< 

LU 

CL 

CD 

-J  -1 

ct 

d 

2 

OC 

K 

a 

< 

CO 

U~' 

CC 

« 

<1 

2  U: 

oc 

c  _l 

CM 

< 

Uj 

m 

tn 

2 

1 

2 

ID 

to 

UJ 

• 

• 

LU 

LU 

LU 

X 

H- 

►— 

UJ 

O  <1 

X 

t- 

b- 

IO 

r-H 

LL 

LL 

LU 

CL 

LL 

lo 

UJ 

U) 

f- 

< 

►- 

<S) 

»— • 

to 

< 

f- 

2 

2 

•-* 

1  u 

>- 

2 

LO 

LU 

IO 

X 

CL 

to 

• 

*— « 

C/O 

a 

X 

** 

2 

to 

pm* 

(/) 

LL 

o 

w 

c 

h- 

LU 

UJ 

LU 

<  — 

C' 

u- 

UJ 

CL 

X 

►— < 

UJ 

CL 

b- 

mm 

►- 

•— 

a 

o 

IS 

LU 

• 

1— * 

J? 

►— » 

*— 

D 

X 

O 

X 

X 

a 

— J 

CL 

o 

K 

8 

LU 

3 

2 

Q. 

Uj 

X 

2: 

CL 

UJ 

2 

CL 

-J 

a 

UJ 

h- 

X 

Uj 

•— i 

UJ 

LU 

b- 

LL: 

► —  Uj 

1 

X 

2 

Cl 

UJ 

<1 

<r 

CL 

Ct 

Uj 

LU 

UJ 

— j 

• 

LL 

< 

LL 

_J 

L> 

*— « 

Uj 

-j 

2L 

rv 

o 

2 

1  -J 

►- 

2 

< 

2 

l-J 

CL 

LU 

< 

LL 

-J 

H- 

• 

•  r 

Uj 

-J 

X 

T" 

— j 

» 

cr 

X 

X 

K 

UJ 

X 

X  t- 

UJ 

LU 

Cj 

X 

< 

LU 

UJ 

O  X 

LU 

• 

CL 

-T~ 

X 

to 

LU 

X 

“™N 

LL 

b~ 

a 

to 

—1 

03 

b- 

ct 

< 

LO 

L/~> 

b- 

a. 

to 

mJ 

to 

C 

■* 

< 

CL 

to 

o 

<  Ct 

►— 

X 

P 

oc 

1  to  « 

h- 

a 

h- 

C 

>  > 

LL 

o 

H 

o 

Cj 

LU 

c_. 

►-* 

CD 

l/) 

a 

cc 

LU 

LU 

-J 

•• 

Cj 

2 

o 

o 

<1 

C 

CC 

-J 

o 

to  Q 

D  _l 

CD 

O 

o 

IO 

c 

X 

CD 

-J 

o 

CL 

o 

o 

b- 

1  *3. 

VI 

2 

IX 

2 

l< 

X 

2 

_J 

X 

2 

< 

<1 

2 

X 

X 

2 

U' 

K 

_ 1 

r— ^ 

o 

UJ 

o 

UJ 

b- 

■mi 

-4  O 

UJ 

-J 

o 

UJ 

CL 

=J 

r—* 

O 

Uj 

h- 

<_j 

UJ 

1 

X 

Uj 

Uj 

< 

<1 

LU 

LL 

a 

O 

O 

lm) 

Ql 

a 

►—« 

• 

" 

m 

m 

m 

* 

r\j 

ro 

<4- 

in 

-c 

n- 

a 

O' 

o 

CM 

m 

in 

t- 

CO 

O' 

o 

r-J 

eg 

ro 

*a- 

in 

v0 

r- 

00 

O' 

o 

cm  ro 

in 

o 

00 

O'  o 

CM 

ro 

>J-  in  «c 

f- 

CD  O' 

o 

f—i 

CM 

ro 

>t  in 

>o 

r- 

03 

0- 

O  r- 

a 

O' 

O' 

O' 

O' 

(X 

O' 

O' 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

r—4 

r-J 

*— < 

r-J 

«— < 

r— « 

m-4 

CM 

rv 

CM 

(M 

cm 

<M 

(Nj 

CM 

rvj 

rj  ro 

ro 

ro 

ro 

ro  ro 

ro 

ro 

ro 

>3 

>* 

>d- 

»»■ 

'j- 

** 

■4- 

m  m 

cm 

<N 

C\l 

(M 

rsj 

CM 

CM 

ro 

ro 

co 

ro 

ro 

ro 

ro 

ro 

ro 

ro 

CO 

ro 

ro 

ro 

ro 

ro 

m 

m 

ro 

ro 

rr, 

ro 

m 

m 

m 

ro 

n 

ro 

r-  ro 

fO 

ro 

n 

ro 

ro  ro 

ro 

ro 

co 

ro 

ro 

ro 

ro 

ro 

ro 

ro 

ro 

ro 

ff. 

ro 

r. 

-108- 


UJ 

IP  4- 

o  x 

O  UJ 

X  z 

u 

z 

UJ  or 
Z  X) 


_J  UJ 

I  a 

2 

UJ  - 
Z  UJ 
tp 
4-  — J 
Z  UJ 
UJ  I 
O  U. 
X  >— 
UJ  I 

a 

-  z 

t-  UJ 
Z  I 

UJ  «I 
2-  - 
UJ 

I-  < 
<1 

4-  t- 
to  Z 

-  UJ 

g 

_J  X 
-J  LL 


-  I 

LL 


E 

<. 


X 

LLi 

z 

z 

a: 

3 


< 

G 


X 

UJ 

z 


X 

G 

z 

< 

a: 

CD 


o 

<1 


UJ 

to 

O 

o 

X 

g 


X 

LJ 

U. 

X 

g 

I 

o 

Q 


G  x  « 
ll  or 


C  - 

l  z 


X 

UJ 


a 

z 


i 
t-  g 
X  z 


<1 

X 

- 

X 

X 

UJ  < 

G  - 

3 

o 

I 

• 

t-  g 

Z  CC 

H- 

1 

c 

o 

z 

X  z 

IE 

LL  Z 

4- 

1 

c 

o 

UJ  < 

9-4 

Z  U 

to 

Q 

1 

•— 

Z  a 

•> 

—  X 

r. 

Z 

< 

LO 

e 

X  Z 

-J  U 

a 

Ll 

IP 

G  < 

1 4- 

i 

1 

UJ 

• 

Z  X 

H* 

3  <r 

< 

< 

c 

cr 

x  z 

<  4- 

X 

LL  4- 

m 

Cl 

g  < 

a  1 

LU 

Z  IP 

» 

X 

Z  X 

E  0C 

z 

— 

< 

<1 

LU 

ui  4- 

UU 

<  4- 

LL 

t- 

on 

-  X 

on 

I- 

CL  1 

-  4- 

H 

Z  _J 

• 

m 

3 

U. 

O 

X 

IE  IP 

_J  < 

U.  _J 

Z 

b- 

< 

-J  Z 

O 

4- 

UJ 

IP 

«.  UJ 

X 

c  < 

C 

on 

•J 

_J 

I 

X 

z 

-  LL 

X  CL 

i_ 

z  g  c 

»— « 

<1  z 

uj 

X 

UJ 

-J  -J 

G  O 

z 

—  z 

on 

1 

G  a: 

LU 

Z 

f— • 

«  1 

LL  | 

< 

UJ  Ll 

on 

I 

Z 

X 

m 

z 

X  < 

l< 

Cl 

-  z 

LU 

h- 

o 

-  K 

t- 

H 

X 

c  » 

a  * 

C 

z  —  z 

QL 

z 

kM 

C.  LL 

V) 

CJ 

Uj 

LL 

LL  _J  CC 

a 

LU 

U 

4-  a 

X 

z 

1  «l 

4-  < 

• 

X  1  X 

X 

s: 

< 

>  a 

_j 

X 

o 

<1 

IP 

< 

G 

h  3  1 — 

UJ 

Uu 

QL 

c 

-  z 

1 

g 

z 

a 

IP  — 

LL  • 

<x 

-  UJ  u  • 

m 

K- 

LU 

z 

-  >-  UJ 

z 

z 

< 

CD 

LU  >- 

a  > 

Z! 

z  or 

<■ 

►- 

UJ 

x  cr 

o 

< 

a 

, 

-J  ct 

L3  a 

O 

» 

— J 

LJ  1  Z 

M 

OT 

CD 

«> 

1  < 

l< 

LL’ 

C  4-  - 

-J 

00 

• 

uj  <  or 

on 

E 

_j 

<r  ►- 

<  — 

1 

<  z  z 

< 

• 

X  -  X 

on 

m 

c 

-  4- 

-  4- 

4- 

IL  U-  U. 

uj 

-J 

X 

LJ  4- 

LLj 

• 

c 

*  ' 

ec 

CL 

Cj 

X  O  X 

-J 

G 

I  <1  UJ 

a: 

or 

z 

c 

<  LL 

<  LL 

Z 

12  4- 

m 

-J 

<  K 

z 

CJ  CC 

Q. 

C 

UJ 

4- 

(- 

1 

1 

» 

►— 

• 

z 

CD 

< 

o 

V 

i 

a 

O 

m 

o 

cc 

-J 

CL 

-J 

G 

< 

U_ 

1 

X 

UJ 

LU 

►— « 

E 

<i 

z 

Z 

z 

< 

-J 

< 

-J 

<  Z 

on 

O 

3 

z 

-J 

f\J 

on 

1 

• 

UJ 

«j 

< 

u 

=J 

<i 

E 

-J 

*— 4 

< 

►— > 

< 

UJ 

UJ 

-j 

z 

1 

►— J 

CC 

c 

< 

on 

O 

UJ 

UJ 

UJ 

K 

UJ 

b- 

UJ 

• 

cc 

< 

LU 

LU 

Ll 

— J 

-J 

< 

X 

G 

UJ 

z 

<. 

z 

cc 

• 

z 

cc 

• 

z 

CD 

• 

CL 

cc 

>  z 

< 

1 

1 

or 

1  UJ 

9— 

LJ 

1 

on 

• 

Of 

1 

cr 

< 

> 

a 

< 

> 

or 

< 

> 

IL 

Uj 

LL 

li¬ 

QC  CC 

-J 

• 

LU 

<■ 

a 

3 

CD 

CL 

u 

< 

*— • 

2. 

CL 

< 

00 

m 

h- 

X 

-J 

CL 

3 

a 

X 

<-J 

CL 

H 

on 

K 

on 

<L  3 

z 

►— 

1 

LU 

< 

< 

<. 

C 

X 

z 

X 

H* 

< 

►— 

«a 

H 

< 

• 

•» 

»—  H- 

o 

o 

1 

<i 

Z 

> 

QC 

» 

t— < 

UJ 

o 

Uj 

UJ 

«• 

K~ 

LL 

cc 

Uj 

m 

3 

3 

b~  UJ 

> 

*— • 

< 

<r 

on 

m 

1  cc 

<1 

LU 

on 

•> 

< 

Z 

QC 

> 

Z 

CC 

>• 

a 

cr 

> 

K 

CC 

oJ 

a 

a  cl 

a 

to 

• 

►— 

»— 

O 

r 

on 

on 

on 

CL 

»— • 

cr" 

< 

CL 

CL 

u 

-J 

LU 

u 

<T 

on 

Uj 

< 

z 

u. 

G 

Lu 

3 

U- 

-J 

LU 

on 

00 

LU 

00 

< 

X 

UJ 

< 

* 

UJ 

< 

LU 

< 

X 

< 

X 

b-  LU 

Q 

LU 

< 

on 

» 

w 

LL 

►— 

1  z 

on 

< 

CL 

-J 

LT> 

*— 

UJ 

to 

►—< 

h- 

o 

00 

CE 

G 

ip 

►—4 

I- 

UJ 

1- 

UJ 

H- 

•  to 

Z 

cr: 

h~ 

« 

Cj 

LU 

< 

LL' 

=u 

CL 

< 

9—» 

— j 

o: 

X 

*— i 

z 

►—4 

on 

O 

• 

►-j 

to 

» 

UJ 

II 

O  (- 

UJ 

II 

a 

i- 

»— H 

IP 

o 

CL 

» 

3: 

z 

LU 

z 

m 

O 

3 

o 

X 

UJ 

3 

1 

cl 

UJ 

X 

3 

«5 

3 

cr 

ip 

X 

IP 

X 

3 

•-> 

G 

X 

Uj 

QC 

LL 

LU 

•— * 

h 

^-4 

Cl 

UJ 

1  Ui 

QC 

U: 

Z 

X 

z 

CL 

UJ 

Z 

X 

— J 

Q 

UJ 

X 

_J 

or 

LU 

LU 

— J 

c 

u 

o 

LL 

-J  CL 

UJ 

U. 

u- 

o'; 

LU 

_ i 

Z 

r-4  •  r 

Z  (NJ 

lu. 

z 

m 

LU 

-J 

o 

Lu 

<1 

LU 

-J 

C 

e 

-J 

Ui 

-J 

• 

o 

— J 

UJ 

— J 

• 

K- 

-J 

Ci 

z 

c 

z 

<3.  UJ 

_ 1 

m 

IP 

i* 

-J 

X 

X 

L_ 

LL 

*7~ 

LU 

c 

X 

3 

o 

> 

X 

X 

• 

cr 

X 

X 

*— • 

•» 

u  < 

X 

3 

> 

• 

LJ 

< 

X 

X 

>- 

m 

UI 

< 

X 

X 

UJ  X 

X 

>■ 

LU 

1— 

QC 

h~ 

IP 

O 

< 

K 

a: 

— 1 

cr 

K— 

CjL 

on 

— j 

cr 

K 

cc 

to 

on  uj 

y- 

CL 

CL 

IP 

UJ 

H- 

CL 

cc 

00 

LJ 

o 

G 

II  K 

or 

CC 

c 

<. 

(_ 

►— i 

cu 

Cj 

H 

u 

m 

c 

on 

LU 

U 

o 

O’/ 

C  — 

Cj 

<1 

— 1 

Cj 

to 

O 

< 

—J 

o 

II 

V 

A 

r  o 

< 

«U 

c 

»- 

l; 

o 

<y 

< 

O 

UJ 

CD 

-J 

a 

UJ 

o 

e 

h- 

=J 

O 

c 

CL 

-J 

o 

o 

*— ) 

— 1 

< 

z 

on 

I 

z 

a 

o 

cc 

<. 

<1 

z 

a 

< 

X 

z 

z 

<1 

X 

Z 

O 

< 

X 

z 

(- 

<. 

u 

Lb 

1  G 

UJ 

UJ 

n 

UJ 

a 

=J 

^  O 

LU 

a 

UJ  H 

u: 

LU 

►—4 

UJ 

u 

UJ 

<1 

UJ 

H  UJ 

UJ 

or 

U' 

(Njm^-ir.'Cr-cDO'o 
in  in  ip  ir,  ip  ip  ip  ip 
mm  noor  (iior 


UJ 


f-4 

rsj 

m 

in 

-o 

CD 

a 

o 

9—4 

»\i 

m 

«*■ 

in 

r- 

CD 

O' 

o 

(M 

cr 

•4- 

in 

vC 

r- 

CD 

O' 

o 

CVJ 

«4 

in 

vC 

r~ 

CO 

o 

o 

»— 4 

(M 

m 

>4 

in 

vC 

c- 

co 

G 

O 

o 

>c 

s0 

vC' 

o 

vC 

o 

v0 

r- 

b- 

r- 

r- 

r- 

h- 

f- 

r- 

r^- 

o- 

E 

ac 

E 

CO 

cc 

cc 

CD 

CC 

O' 

cr 

a 

O' 

O' 

a 

O' 

O' 

O' 

O' 

o 

o 

O 

o 

O 

c 

O 

o 

o 

O 

rJ  — 1 

ro 

no 

cr 

m 

ro 

rh 

cr 

m 

co 

cr 

m 

ro 

PO 

cn 

m 

ro 

n 

cr 

m 

cr 

m 

cr 

cr 

9\ 

cr 

ro 

cn 

cn 

m 

m 

cn 

m 

cn 

cn 

fp 

■4- 

•4- 

>4- 

>4- 

<4- 

»4- 

<*• 

>4- 

>4-  ^4- 

-109- 


X 

UJ 

Z 

Z 

oc 

3 


CO 

CO 

¥ 


a 

« 

a 


x 

o 


x 

UJ 


CO 

C_i 

o 

X 


fr- 

z 

<r 

»- 

X 

«> 

K 

X 

UJ 

r— 

»— 

X 

*- 

3 

z 

X 

< 

UJ 

X 

Z 

X 

UJ 

3 

fr- 

z 

UJ 

►- 

z 

X 

X 

• 

Z 

z 

»- 

3 

X 

z 

Uj 

u 

z 

z 

QC 

X 

X 

u- 

X 

< 

»— * 

z 

3 

3 

z 

o 

QC 

z 

■“s 

3 

IV 

o 

Z 

o 

a 

X 

X 

K 

►— 

3 

QC 

3 

z 

z 

z 

z 

CD 

X 

UJ 

UJ 

X 

z 

lO 

v~ 

3 

►- 

3 

3 

< 

o 

z 

z 

LL 

tt 

iO 

UJ 

t- 

X 

QC 

z 

z 

tt 

m> 

z 

X 

Z 

X 

UJ 

QC 

UJ 

UJ 

QC 

QC 

z 

X 

a 

u 

<1 

<_> 

z 

UJ 

z 

h- 

c 

QC 

z 

» 

3 

a 

L. 

fr— 

Of 

z 

a 

CC 

cc 

z 

LU 

a 

• 

-J 

3 

3 

3 

• 

< 

cc 

< 

3 

<1 

3 

cr 

QC 

X 

cO 

» 

z  < 

3 

3 

3 

< 

h- 

z 

cc 

3 

-J 

3 

X 

UJ 

3 

l/) 

QC 

QC 

a 

QC 

3 

cc 

u 

UJ 

• 

ft 

UI 

*— 

3 

h- 

m 

• 

Z 

3  Uj 

QC 

cz 

< 

fr- 

X 

QC 

m 

QC 

LLi 

3 

4— 

3 

K 

fr- 

» 

m 

a: 

<r 

— J 

» 

fr- 

C 

QC 

4 

-J 

X 

CL 

LU 

3 

3 

» 

• 

fr¬ 

u; 

a 

UJ 

» 

to 

m 

z 

QC 

-J 

t 

I  QC  3 

< 

< 

3 

c, 

ee 

1 

*— * 

c 

-J 

•— 

UJ 

u 

m 

3 

<1 

>- 

> 

1 

QC 

or 

4 

c 

X 

< 

fr- 

*— • 

< 

-J 

3 

3 

K- 

u 

QC 

a 

*» 

tt 

3 

3 

QC 

< 

V) 

— 

-j 

> 

tt 

UJ  1 

cc 

fVJ 

< 

•— i 

4 

4 

K 

UJ 

3 

3 

3 

1 

1 

3 

*— « 

UJ 

to  2 

«3 

h- 

QC 

3 

m 

z 

Z 

O 

fr- 

1-4 

3 

<1 

< 

< 

X 

c 

fr— 

C  C 

M 

I 

X 

UJ 

1 

to 

3 

zz 

Z  3 

3 

3 

►— 

• 

• 

1 

1 

*— 4 

o  *- 

CL 

o 

LL 

»- 

QC 

| 

1 

1  4 

1 

1 

3 

• 

c 

< 

<T 

o 

-J 

X  to 

«X 

z 

z 

UJ 

to 

c 

< 

<1 

tt 

3 

3 

1 

< 

< 

z 

m 

• 

z 

1 

o  to 

> 

< 

3 

K 

UJ 

m 

• 

< 

4 

4 

3 

tt 

UJ 

UJ 

o 

UJ 

1 

oc 

1 

O 

X 

X 

3 

a 

*— i 

» 

< 

.J 

< 

< 

UJ 

•  CC 

o 

cc 

»- 

4 

h- 

4 

4 

< 

o 

3 

3 

CC 

> 

> 

o 

z 

UJ 

z 

X 

-J 

ll  a 

z 

X 

< 

QC 

z 

1 

3 

3 

1 

or 

tt 

r? 

a 

ce 

• 

or 

*— < 

LL) 

-J  X 

UJ 

UI 

o 

O 

4 

UJ 

m 

90 

<1 

1 

1 

< 

< 

c 

c 

< 

> 

>► 

3 

LL 

CC 

CD  UJ 

1 

CO 

z 

3 

X 

CL 

>- 

> 

> 

< 

4 

• 

c 

o 

fr- 

CL 

QC 

fr- 

1 

<1 

<  - 

4 

•—4 

<1 

u. 

3 

< 

QC 

tt 

CL 

• 

« 

z 

z 

UJ 

UJ 

< 

< 

LL. 

< 

-J 

» 

2 

a 

1 

1 

CL 

4 

< 

< 

< 

< 

o 

c 

to 

CL 

• 

X 

X 

CL 

CC  -J 

QC 

cc 

4 

4 

1 

X 

X 

X 

< 

4 

LJ 

lj 

• 

> 

*— 

►— * 

CL 

<  «=J 

rvi  < 

U_ 

m 

•• 

H 

►—4 

►— < 

o 

e> 

Uj 

LL 

lL> 

CL 

tt 

CL 

LL 

< 

UJ 

>  <3 

I 

K 

LL 

or 

O' 

tt 

z 

3 

3 

z 

to 

c o 

-J 

OO 

to 

< 

CL 

CL 

CO 

»— 1 

1  LJ 

1- 

Z 

4 

< 

UJ 

Cc 

CL 

CL 

*— < 

Z 

3 

*— i 

» 

• 

-J 

*— * 

X 

• 

«—■ 

CL 

LL 

<  w 

o 

fr- 

UJ 

_J 

» 

• 

» 

tt 

| 

| 

QC 

<r 

e— j 

to 

UJ 

m 

X 

O 

QC 

o 

fr— 

QC 

QC 

3 

— J 

— i 

L> 

tt 

Ui 

• 

a: 

-J 

— I 

a 

LL! 

o 

fr- 

U-1 

•— < 

UJ 

z 

<1 

3 

-J 

-J 

to 

3 

3 

t/J 

_) 

— J 

— 

U-. 

_ 1 

> 

Q- 

-J 

— 1 

Ui 

_ J 

UJ 

z 

1  CD 

►—« 

m 

3 

mJ 

L 

1 

co 

CD 

1 

< 

< 

L- 

3- 

a 

• 

Uj 

<1 

< 

X 

X 

• 

UJ 

1- 

UJ 

o 

X 

tt 

4 

<1 

< 

c 

T 

X 

3 

<w 

o 

< 

fr~ 

a: 

< 

to 

O 

LJ 

H 

tt 

> 

to 

z 

a 

c 

3 

fr- 

< 

O 

o 

o 

<1 

3 

3 

4—4 

4- 

♦ 

C- 

CJ 

— j 

O 

* 

V* 

Cj 

CL 

O 

•— < 

Oj 

Z 

to 

w- 

1 

4- 

r 

CL 

Z 

Z 

CD 

O 

z 

-j 

O 

c 

o 

O  <i 
O  (_) 


CO 


X 


X  X 
•—  <_> 
0C 

CL 


o 

z  < 

UJ  3 
< 
c 


f\j 

m 

LT 

40 

h- 

GO 

O' 

o 

0—4 

CsJ 

fO 

•a- 

in 

40 

r- 

oc 

O' 

O 

fM 

ro  4 

in 

4) 

oo 

O' 

O 

_4 

fM 

ro 

«4 

40 

3 

CD 

O' 

0—4 

*— < 

r—i 

0—4 

0—4 

fM 

<NJ 

r\j 

C\J 

(VI 

r\i 

fM 

C\i 

f\j 

OJ 

ro 

rr. 

m 

m  fo, 

ro 

ro 

m 

ro 

ro 

4* 

s4- 

44 

"t 

44 

44 

«4 

•4- 

44 

4t 

4 

*4* 

4 

4 

4“ 

sf 

4 

4- 

4- 

4- 

•4- 

4* 

4- 

4‘ 

>!• 

4 

4* 

4  <4- 

4* 

•4-  <4- 

4* 

44- 

44- 

<4- 

44- 

4t 

«4 

* 

44- 

44- 

44 

44 

-no- 


Appendix  B 


Special  Characters 


The  following  are  valid  as  special  characters: 


( 

) 

S 


< 

> 


+ 

* 

/ 

I  (or) 

•cat1  (catenate  bar:  the  keyword  is  used  in  the 
syntax  language.  In  programs  to  be  compiled, 
it  is  just  an  or  symbol,  but  it  must  be 
immediately  followed  by  another  or  symbol.) 


Ill 


Appendix  C 


Syntactic  Constants  Recognized  by  the  Scanner 


The  following  are  the  syntactic  token  classes  recognized 
by  the  Scanner.  Some  have  abbreviations  which  are  used  in 
the  syntax  language.  Definitions  of  these  terms  are  in 
[Holt  and  Wortman  1973  ]. 


IDENTIFIER 

INTEGER 

NUMBER 

STRING 

BIT_STRING 

BA.D_  IDENTIFIER 

BA.D_  INTEGER 

NUMBER_TOO_LARGE_ EXPONENT 
NU MB ER_TR UNCAT ED_F FACTION 
NU  MBER_TOO_LARGE_FRACTION 
NUMBER_NO_EXPONEN  T 
NUMBER_INCOMPLETE_ EXPONENT 
BAD  STRING 


BAD_IDENT 

BAD_INT 

NUMBER_TLE 

NUMBER_TF 

NUMBER_TLF 

NUMBER_NE 

NUMBER  IE 


By  changing  the  Scanner  and  using  some  of  these  keywords  to 
mean  different  things  than  they  do  in  SP/k,  other  token 


classes  (such  as  hexadecimal  numbers)  could  be  developed 


Appendix  D 


This 

Language 


The  Syntax  Language 


Appendix  contains  the  syntax  charts  and  Syntax 
description  for  the  Syntax  Language. 
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Appendix  E 


Organization  of  the  Table  Generator 


In  Chapter  7  we  discussed  how  to  use  the  Table  Generator 
to  produce  Parsers  for  various  languages.  This  appendix 
presents  the  internal  logic  of  the  Table  Generator. 

The  Table  Generator  has  its  own  copy  of  the  Syntax  Table 
which  it  fills  up  as  it  processes  the  description  of  the 
user's  language.  When  it  has  finished  processing  a 
description,  it  writes  out  a  copy  of  this  table  to  be  used 
in  the  Parser.  (When  we  mention  Syntax  Table  in  this 
appendix,  we  mean  the  Table  Generator's  copy.) 


The 

Table  Gener 

ator  is  basically 

an  assembl 

assembles 

the  Syntax 

Table  (Chapter  4) 

from  the 

Language 

of  Chapter 

3.  This  appendix 

discusses 

different 

information 

is  processed. 

er ;  it 
Syntax 
how  the 


For  the  purpose  of  differentiating  names,  we  prefix  the 
word  T  (for  token)  to  names  of  items  in  the  Syntax  Table; 
the  word  A  (for  action)  is  prefixed  to  semantic  identifiers. 
This  Appendix  refers  to  the  names  in  the  description  in 
Appendix  D. 
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E . 1  One-for-one  Insertion 

Some  tokens  of  the  user’s  description  simply  indicate 
that  one  specific  piece  of  information  should  be  inserted 
into  the  Syntax  Table.  This  is  done  using  a  vector  called 
Emit  Vector;  it  is  indexed  by  semantic  identifiers,  and 
contains  possible  Syntax  Table  entries.  There  are  four 
classes  of  semantic  actions  which  cause  use  of  Emit  Vector. 
These  are: 

-  A  Return  which  causes  insertion  of  T  End  Production 

-  special  character  actions  -  for  example  the  action 

indicator  A  Comma  causes  insertion  of  T  Comma 

-  paragraphing  token  actions  -  for  example  A  Indent 

causes  insertion  of  T  Indent 

-  constants  -  these  are  the  syntactic  token  classes  that 

are  recognized  by  the  Scanner.  For  Example,  A  Bit 

String  causes  insertion  of  T  Bit  String. 

E.2  Keywords  and  Semantic  Identifiers 

Keywords  and  semantic  identifiers  are  encoded  as  strings 
in  the  Syntax  Language,  and  the  Parser  passes  these  strings 
to  the  Table  Generator.  Immediately  following  the  string, 
an  encoded  semantic  identifier  is  sent  to  the  Table 
Generator;  for  these  two  token  classes  the  identifiers  are  A 
Terminal  String  and  A  Semantics,  respectively. 
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keywords 


and  semantic 


In 


the  Table  Generator, 


identifiers  are  saved  in  two  vectors.  The  position  of  an 
entry  in  one  of  these  vectors  is  its  encoded  value.  To 
process  either  one  of  these  types  of  information,  the  Table 
Generator  first  does  a  table  lookup  to  see  if  the  string  is 
in  the  appropriate  vector. 


If  a  keyword  is  not  present,  then  it  is  inserted  in  the 
keyword  vector;  the  position  of  the  entry  (whether  already 
present  or  new)  is  inserted  in  the  Syntax  Table. 

If  a  semantic  identifier  is  not  present,  an  error  is 
signalled;  all  these  identifiers  are  required  to  be  declared 
at  the  first  of  the  description.  If  it  is  present,  its 
position  in  the  vector  is  inserted  in  the  Syntax  Table. 


E.3  Branches  and  Labels 


The  branch  addressing  mechanism  is  explained  in  Chapter 
4.  This  section  describes  how  the  Table  Generator  handles 
branches.  It  is  possible  to  have  branches  forward  and 
backward  in  the  tables.  Branches  forward  occur  when  the 
branch  is  defined  before  the  label;  branches  backward  occur 
when  the  label  is  defined  before  the  branch. 


E.3.1  Data  Structures  for  Branching 

There  are  two  vectors  that  are  used  by  the  Table 
Generator  for  storing  information  about  branches  and  labels. 
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The  first,  called  Label  Stack  and  indexed  by  Label  Pointer, 


consists  of  records  of  three  fields. 


r 

|  Name  |  Address  J  First  Reference  | 

- - - j 


This 

vector 

is  reused  for  each 

rule  in 

the  user's 

description.  An 

entry  is  created 

in 

it  f  or 

each  label 

defined 

in  the 

rule.  The  Name 

is 

the  integer  used  to 

identify  the  label.  The  Address  is  an  index  into  Syntax 
Table,  to  the  label.  The  First  Reference  is  either  the 
constant  Last  Reference  (indicating  no  branches  to  this 
label  have  occurred  yet) ,  or  an  index  into  the  second  data 
structure,  called  Reference  Table. 

Every  time  a  branch  is  encountered  in  a  rule,  an  entry 
is  made  in  Reference  Table.  It  is  necessary  to  keep  track 
of  all  branches  and  resolve  them  at  the  end  of  the 
processing  for  a  whole  rule,  because  the  processing  of 
choices,  as  described  in  Section  E.5,  may  cause  shifting  of 
sections  of  the  emitted  Syntax  Table.  This  shifting  can 
cause  labelled  places  in  the  table  to  be  moved;  then  the 
branch  address  may  need  to  be  recalculated.  The  simplest 
method  of  dealing  with  this  was  to  record  all  branches,  and 
fix  them  all  at  the  end  of  a  rule. 

The  Reference  Table  consists  of  records  with  two  fields. 

r  '  “ 

|  Place  |  Next  Reference  | 

i _ _ _ 1 
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The  Place  is  the  index  of  the  branch.  Next  Reference  is 
either  the  constant  Last  Reference,  or  an  index  into 
Reference  Table.  All  sources  that  refer  to  a  common  target 
are  chained  together  by  the  Next  Reference  indices. 

Here  is  an  example  of  how  these  data  structures  are 
used.  Suppose  the  sequence  of  branches  and  labels  in  a  rule 
was:  Branch  2,  Label  1,  Eranch  3,  Label  3,  Branch  3,  Branch 
1.  Just  after  processing  the  'Branch  1',  the  vectors  would 
look  like  this: 


Address 

Name  First  Place  Next 

Reference  Reference 


1  1 

1  1 

1  I 

1 

1 

1 

1  1 

1  1 - 

1  1  P4 

1  1 

I  —  1 
t  X  | 

1 - 1 

1  3  | 

- 1  — 

A3  | 

I  i 

l—i  1  P3 

,  , - 

|  L->j  P2 

1  1 

'  1 - 1 

1  X  | 

1 - 1 

1  1  I 

- 1 - 

A1  |  X 

1 - 1 

1  3  | 

1  1 

1  2  | 
i _ 

Label 

XX  1 

Stack 

1  1 - 

1 - >1  PI 

-J  1 - - 

Reference 

1 - 1 

1  4  | 

Table 

E.3.2  Processing  for  Branching 


When  the  action  A  Branch  Point  is  recognized,  the  Table 
Generator  inserts  T  Branch  into  the  table,  followed  by  a 
zero  for  the  relative  address.  If  there  is  no  entry  in  Label 
Stack  for  the  associated  target,  then  one  is  made  with  an 
address  consisting  of  the  constant  Label  Not  Defined.  The 
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current  branch  is  then  inserted  in  the  (possibly  empty) 
chain  of  references  to  the  appropriate  label. 

On  recognition  of  the  action  A  Define  Point,  the 
function  Search  Label  Stack  is  invoked  and  it  returns  a 
pointer  to  the  record  for  the  appropriate  label,  if  present. 
This  record  has  its  address  field  updated  from  Label  Not 
Defined,  to  the  current  index  into  the  Syntax  Table.  If 
there  is  no  record,  indicating  that  there  have  been  no 
forward  branches  to  this  label,  then  an  entry  is  made  into 
Label  Stack,  with  Next  Reference  set  to  the  constant  Last 
Reference . 

At  the  end  of  each  rule  the  chain  of  references  for  each 
target  in  the  Label  Stack  is  read.  Each  branch  has  its 
appropriate  relative  address  computed  and  inserted.  The 
pointers  to  branches  are  updated  when  choice  handling 
(Section  E.5)  causes  shifting  of  sections  of  the  Syntax 
Table . 

E . 4  Calls  and  Names 

The  processing  of  calls  and  returns  is  similar  to  the 
processing  of  branches  and  labels;the  main  difference  is 
that  branches  are  all  local  to  one  rule,  while  calls  by 
definition  cross  from  one  rule  to  another.  Therefore  the 
information  for  calls  is  not  deleted  after  each  rule,  as  is 
done  for  branches  and  labels. 
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E.4.1  Data  Structures  for  Calls  and  Names 


There  are 
names  and  calls 
for  processing 
an  element  for 
processed  by 
of  five  fields. 


two  vectors  used  for  processing 
which  are  analogous  to  the  data 
labels  and  branches.  The  Call  Sta 
each  production  name  in  the 
the  Table  Generator.  Each  element 


product  ion 
structures 
ck  contains 
descript  ion 
is  a  record 


r  - - - 

| Length! String | Address | Referenced | First  Reference! 

i - - - —  - - - - - - — j 

The  String  is  a  vector  of  characters  (there  are  Length  of 
them) ,  containing  the  name  of  the  production.  The  Address 
is  an  index,  encoded  by  the  mechanism  described  in  Section 
4.6,  or  it  is  the  constant  Production  Address  Not  Defined. 
The  logical  variable  Referenced  is  used  to  check  that  each 
rule  (except  possibly  the  first,  or  goal  rule)  is  called  in 
the  description.  This  is  necessarily  explicitly  coded  for 
this  data  structure  as  chains  of  references  are  not  always 
retained,  as  is  explained  in  the  following.  First  Reference 
is  either  the  constant  Last  Reference,  or  else  it  is  an 
index  into  the  second  data  structure,  called  Call  Reference 
Table . 

This  is  a  vector  of  records,  each  containing  two  fields 
as  follows: 
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r  '  '  i 

|  Address  |  Next  Reference  | 

- - — - -  —  - - j 

The  Address  is  an  index  into  the  Syntax  Table  locating  a 
production  call.  Next  Reference  either  indexes  into  Call 
Reference  Table  to  the  next  call  to  the  same  production,  or 
it  is  the  constant  Last  Reference. 

E.4.2  Processing  Calls  and  Names 

On  recognition  of  the  action  A  Call,  the  token  T  Call 
Production  is  inserted,  and  it  must  be  followed  by  an 

address.  If  the  call  is  for  a  rule  that  has  already  been 

encountered,  then  its  address  is  in  the  Call  Stack,  and  is 
inserted  into  the  Syntax  Table.  If  the  rule  has  not  yet 

been  encountered,  then  an  entry  is  made  in  the  Call  Stack  if 
none  exists  for  the  rule  in  guestion,  and  the  call  is  added 
to  the  (possibly  empty)  chain  for  that  production. 

The  action  A  Rule  Name  causes  insertion  of  enough  A  No 
Action  signals  into  the  Syntax  Table  to  force  alignment  on  a 
multiple  of  Call  Production  Alignment.  Then  the  Call  Stack 
is  searched  for  an  already-existing  entry  for  this  rule 
name.  If  one  exists  and  has  a  defined  address,  then  a 

duplicate  rule  error  is  signalled.  But  if  no  address  has 
been  defined  then  it  is  the  case  that  calls  have  already 
been  recognized  for  this  rule.  Accordingly,  each  call  source 
is  fixed  with  the  address  of  the  next  entry  in  the  Syntax 
Table.  If  no  entry  exists  in  Call  Stack,  then  one  is  made. 
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E . 5  Choices 


When  a  choice  is  encountered  in  the  description,  we  do 
not  know  how  many  possibilities  will  be  associated  with  it. 
There  will  be  at  least  two,  but  there  could  be  any  number. 
The  Table  Generator  will  leave  room  for  two  addresses  and 
then  keep  track  of  the  addresses  cf  all  the  possibilities. 
If  there  are  more  than  two  possibilities  in  the  choice,  then 
the  code  emitted  for  the  choice  branches  will  be  shifted  up 
in  the  table  to  make  room  for  the  rest  of  the  addresses  when 
the  end  of  the  choice  is  reached. 

E.5.1  Data  Structures  for  Choices 

There  are  two  data  structures  associated  with  the 
processing  of  choices.  The  first  is  the  Choice  Stack,  which 
consists  of  records  of  two  fields. 

r - * - 

|  Address  |  Branches  Allocated  | 

i _ _ _ 1 

The  Address  is  an  index  to  the  choice  node  in  the  Syntax 
Table.  Branches  Allocated  counts  the  number  of 
possibilities  that  have  been  recognized  for  the  associated 
choice  node.  The  strict  nesting  of  choices  ensures  that  a 
stack  can  be  used  and  that  the  record  at  the  top  of  the 
stack  is  the  one  to  be  updated  when  a  new  branch  is  started. 
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The  second  data  structure  is  Choice  Branch  Stack  which 


is  a  vector  of  indices  into  the  Syntax  Table;  these  point  to 
the  various  possibilities  associated  with  choice  nodes.  The 
entries  for  the  deepest  nested  choice  currently  being 
analyzed  are  at  the  top  of  the  stack. 

E.5.2  Processing  Choices 

There  are  four  actions  which  cause  processing  related  to 
choices. 

The  action  A  Choice  is  the  signal  for  insertion  of  the 
token  T  Choice,  followed  by  two  zeroes,  into  the  Syntax 
Table.  Choices  must  have  at  least  two  possibilities,  hence 
we  emit  two  place  holders  (the  zeroes)  for  addresses.  In 
addition,  an  entry  is  made  in  Choice  Stack  saying  that  one 
branch  has  been  allocated,  and  the  address  following  the 
second  zero  is  pushed  onto  Choice  Branch  Stack. 

When  A  Next  Choice  is  recognized,  the  Choice  Stack  is 
updated  by  incrementing  the  field  Branches  Allocated  for  the 
current  choice.  This  signal  occurs  at  the  end  of  each 
possibility  but  the  last,  so  the  count  is  in  reality  always 
set  at  the  beginning  of  the  possibility  -  the  first 
possibility  starts  with  Branches  Allocated  set  to  one,  and 
the  end  of  each  possibility  except  the  last  one  causes 
incrementation  of  the  count. 

A  Otherwise  simply  causes  emission  of  T  Otherwise. 
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a  possibility 


When  A  End  Choose  is  recognized, 
consisting  only  of  the  signal  T  End  of  Choices  is  added 
unless  the  last  possibility  present  starts  with  T  Otherwise. 
Then,  if  more  than  two  possibilities  have  been  emitted,  all 
of  the  code  emitted  for  the  possibilities  must  be  moved  up 
in  the  Syntax  Table  in  order  to  make  room  for  the  rest  of 
the  addresses  which  must  be  inserted  following  the  choice 
node.  As  well  as  moving  the  code,  affected  branches  and 
calls  must  be  fixed.  The  Label  Stack  is  scanned,  and  if  a 
label  has  been  defined  in  the  portion  to  be  moved,  then  the 
Address  field  must  be  changed  accordingly.  As  well,  the 
chain  of  references  for  each  label  is  scanned,  and  any 
sources  of  branches  that  occur  in  the  affected  portion  must 
have  the  Place  field  of  Reference  Table  changed.  All 
unresolved  calls  must  have  their  pointers  in  Call  Reference 
Table  updated.  Finally,  the  relative  branch  addresses  are 
computed  and  inserted  following  the  choice  node  in  the 
table . 
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